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1. Introduction 

If {Xi)i^z is a real-valued stationary process, then from a second-order inference point of view it is charac- 
terized by its mean /i = EX^ and the autocovariance function 7^ = E[(Xo — /i)(Xfe — /i)], G Z. Assume 
fi — 0. Given observations Xi, . . . ,X„, the natural estimates of and the autocorrelation — jk/jo are 



7fc = (1/n) ^ Xj_|fe|Xj and rfe = 7^/70, 1 - n < fc < n - 1, 

i=|fc|+l 



(1) 



respectively. The estimator 7^. plays a crucial role in almost every aspect of time series analysis. It is well- 
known that for linear processes with independent and identically distributed (iid) innovations, under suitable 
conditions, ^Jnijk — 7^) A/'(0,r|), where =4> stands for convergence in distribution, A/'(0, r^) denotes the 
normal distribution with mean zero and variance r^. Here t| can be calculated by Bartlett's formula (see 



Section 7.2 of Brockwell and Davis (1991 1). Other contributions on linear processes include Hannan and 



Heyde (1972), Hosoya and Taniguchi (19821, Anderson (19911 and Phillips and Solo (1992) etc. Romano 



and Thombs (1996) and Wu (2009) considered the asymptotic normality of % for nonlinear processes. As a 



primary goal of the paper, we shall study asymptotic properties of the quadratic (or L'^) and the maximum 
(or C°°) deviations of ^k- 



1.1. The £2 Theory 

Testing for serial correlation has been extensively studied in both statistics and econometrics, and it is a 



standard diagnostic procedure after a model is fitted to a time series. Classical procedures include Durbin 



and Watson 


( 


1950 


1951 





portmanteau test uses Qk = n ^^^^ statistic, and rejects if it has in the upper tail of x|- 

distribution. An arguable deficiency of this test and many of its modified versions (for a review see for 



example Escanciano and Lobato (2009)) is that the number of lags K included in the test is held as a 



constant in the asymptotic theory. As commented by Robinson (1991): 



"...unless the statistics take account of sample autocorrelations at long lags there is always the possibility that 
relevant information is being neglected..." 

The problem is particularly relevant if practitioners have no prior information about the alternatives. The at- 
tempt of incorporating more lags emerged naturally in the spectral domain analysis; see among others [Durlauf] 



(1991), Hong (1996) and 



Deo 



(2000). The normalized spectral density /(w) — (27r)^^ ^^.^^ cos(fca;) should 
equal to {2Tr)~^ when the serial correlation is not present. Let /(w) — X]fc=i-ri ^''{k/sn)fk cos(fcw) be the lag- 
window estimate of the normalized spectral density, where h{-) is a kernel function and s„ is the bandwidth 
satisfying the natural condition s„ — >■ cx) and s„/n — ?■ 0. The former aims to include correlations at large lags. 
A test for the serial correlation can be obtained by comparing / and the constant function f{uj) = {2tt)^^ 
using a suitable metric. In particular, using the quadratic metric and rectangle kernel, the resulting test 



statistic is the Box-Pierce statistic with unbounded lags. Hong (19961 established the following result: 

1 ' ° 



(2) 



under the condition that Xi are iid, which implies that all in the preceding equation are zero. |Lee and] 



Hong (20011 and Duchesne, Li and Vandermeerschen (2010) studied similar tests in spectral domain, but 



using a wavelet basis instead of trigonometric polynomials in estimating the spectral density and henceforth 

prol 

adapative Neyman test and thresholding tests, using Taaxi<:k<s„iQk 



working on wavelet coefiicients. Fan ( 1996 ) considered a similar problem in a different context and proposed 

k)/V2k and nJ2h"=i ^fe-^(l^fc| > ^) as 



test statistics respectively, where (5 is a threshold value. Escanciano and Lobato (2009) proposed to use Qs 
with s„ being selected by AIC or BIC. 



It has been an important and difficult question on whether the iid assumption in Hong ( 1996 1 can be 

relaxed. Similar problems have been studied by Durlauf (1991), Deo (2000) and [Hong and Lee ( |2003 ) for 

the case that Xi are martingale differences. Recently Shao (2011) showed that ^ is true when (Xi) is a 

general white noise sequence, under the geometric moment contraction (GMC) condition. Since the GMC 

condition, which implies that the autocovariances decay geometrically, is quite strong, the question arises 

2 



as to whether it can be replaced by a weaker one. Furthermore, one may naturally ask: what if the serial 
correlation is present in ([2|? To the best of our knowledge, there has been no results in the literature for 
this problem. This paper shall address these questions and substantially generalizes earlier results. We shall 
prove that ([2| remains true even if all or some of are not zero, but the variance of the limiting distribution, 
being different, will depend on the values of r^. Furthermore, we derive the limiting distribution of X]fc=i 
when the serial correlation is present. The latter result enables us to calculate the asymptotic power of the 
Box-Pierce test with unbounded lags. 



1.2. The C°° Theory 



Another natural omnibus choice is to use the maximum autocorrelation as the test statistic. Wu (2009) 
obtained a stochastic upper bound for 



'n max - lk\, 



(3) 



and argued that in certain situations the test based on ([3| has a higher power over the Box-Pierce tests 
with unbounded lags in detecting weak serial correlation. It turns out that the uniform convergence of 
autocovariances is also closely related to the estimation of orders of ARMA processes or linear systems in 
general. The pioneer works in this direction were given by E. J. Hannan and his collaborators, see for example 



Hannan (1974) and An, Chen and Hannan (1982 1. For a summary of these works we recommend (Hannan 



and Deistler[ 1988 Section §5.3) and references therein. In particular. An, Chen and Hannan ( 1982 ) showed 



that if s„ ~ 0[(logn)"] for some a < oo, then with probability one 

O (log log n) . 



/n max {-jk - lk\ 

l<k<Sn 



(4) 



The question of deriving the asymptotic distribution of (|3| is more challenging. Although Wu ( 2009 ) was 
not able to obtain the limiting distribution of ([s]), his work provided important insights into this problem. 
Assuming fc„ — >■ oo, fc„/n — > and h > 0, he showed that, for Tk = \/n{'^k — ^Ik), 



{Tk„ , Tk^+hY N 





f (To 


crh\ 













, where cr/i = ^7fc7fc+/i, 



(5) 



kei 



and we use the superscript T to denote the transpose of a vector or a matrix. The asymptotic distribution 
in ([5]) does not depend on the speed of fc„ — > 00. It suggests that, at large lags, the covariance structure of 
{Tk) is asymptotically equivalent to that of the Gaussian sequence 



where ry^'s are iid standard normal random variables. Define the sequences (a„) and (5„) as 

a„ = (21ogn)-i/2 and bn = i2logny/^ - {8\ogn)-^/^(\og\ogn + logA'K). 
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(6) 



(7) 



According to Berman (1964) (also see Remarks [s] and |4]) , under the condition lini„^oo E(GoG„) log7i = 0, 

= exp{— exp(— x)}. 



lim P max |Gi| < y/ao{a2s x + 62s 

" ' "~ , l<i<s 



Therefore, Wu (2009) conjectured that under suitable conditions, one has the Gumbel convergence 



lim P ( ^max \Tk\ < \/(To(a2s„ x + 52s„) ) = exp{- exp(-a;)}. 



(8) 



In a recent work, Jirak (2011) proved this conjecture for linear processes and for s„ growing with at most 



logarithmic speed. We shall prove ([s]) in Section |4] for general stationary processes; and our result allows s„ 
to grow as s„ = 0{n^) for some < 77 < 1, and 77 can be arbitrarily close to 1 under appropriate moment 
and dependence conditions. The latter result substantially relaxes the severe restriction on the growth speed 
in (|4| and Jirak (2011) and, moreover, the obtained distributional convergence are more useful for statistical 
inference. For example, other than testing for serial correlation and estimating the order of a linear system, 
dsl) can also be used to construct simultaneous confidence intervals of autocovariances. 



1.3. Relations with the Random Matrix Theory 

In a companion paper, using the asymptotic theory of sample autocovariances developed in this paper. 



Xiao and Wu ( 2010 ) studied convergence properties of estimated covariance matrices which are obtained by 



banding or thresholding. Their bounds are analogs under the time series context to those of [Bickel and Levina| 
( |2008a|b[ ). There is an important difference between these two settings: we assume that only one realization 
is available, while Bickel and Levina ( 2008a|b" ) require multiple iid copies of the underlying random vector. 

There has been some related works in the random matrix theory literature that are similar to ([8| . Suppose 
one has n iid copies of a p-dimensional random vector, forming apxn data matrix X. Let fij, 1 < i, j < p, be 



the sample correlations. Jiang ( 2004 ) showed that the limiting distribution of maxi 



<i<j<p Fiji I 



after suitable 



normalization, is Gumbel provided that each column of X consists of p iid entries and each entry has finite 
moment of some order higher than 30, and p/n converges to some constant. His work was followed and 



improved by Zhou (2007) and Liu, Lin and Shao (20081. In a recent article, Cai and Jiang (2010) extended 



those results in two ways: (i) the dimension p could grow exponentially as the sample size n provided 
exponential moment conditions; and (ii) they showed that the test statistic max|i_j|>s^ |fy | also converges 
to the Gumbel distribution if each column of X is Gaussian and is s„-dependent. The latter generalization 
is important since it is one of the very few results that allow dependent entries. Their method is Poisson 



approximation (see for example Arratia, Goldstein and Gordon 1989), which heavily depends on the fact 



that for each sample correlation to be considered, the corresponding entries are independent. Schott (2005) 
proved that J2i 



<i<j<p ij 



converges to normal distribution after suitable normalization, under the conditions 



that each column of X contains iid Gaussian entries and p/n converges to some positive constant. His proof 
heavily depends on the normality assumption. Techniques developed in those papers are not applicable here 
since we have only one realization and the dependence structure among the entries can be quite complicated. 



1.4- A Summary of Results of the Paper 



We present the main results in Section [2] which include a central limit theory of ^ and the Gumbel 
convergence (|8]). The proofs are given in Section|4j In Section[5]we prove a normal comparison principle, which 
is of independent interest. Since summability conditions of joint cumulants are commonly used in time series 
analysis (see for example Brillinger (20011 and [Rosenblatt (1985)) and is needed in the proof of Theorem |4j 
we present a sufficient condition in Section |6] Some auxiliary lemmas are collected in Section [7j We also 
conduct a simulation study in Section |3j where we design a simulation-based block of blocks bootstrapping 
procedure that improves the finite-sample performance. 



2. Main Results 



To develop an asymptotic theory for time series, it is necessary to impose suitable measures of dependence 



and structural assumptions for the underlying process (Xi). Here we shall adopt the framework of Wu (2005 1. 
Assume that (Xi) is a stationary causal process of the form 

X, = g{- ■ ■ ,e,-i,ei), (9) 

where e, , z g Z, are iid random variables, and 5 is a measurable function for which Xi is a properly defined 
random variable. For notational simplicity we define the operator 51^: suppose X = h{ej, e^-i, . . .) is a random 
variable which is a function of the innovations e;, I < j, then D,k{X) :— h(ej, . . . , efc+i, ej,, ek-i, ■ ■ where 
(efc)feez is an iid copy of {ek)kez- Namely in X is replaced by e^.. 

For a random variable X and p > 0, we write X <E if \\X\\p :— (E\X\Py^P < 00, and in particular, use 
for the £^-norm ||^|l2- Assume Xi g C^, p > 1. Define the physical dependence measure of order p as 

6pii) ^ \\X, ^ na{X,)\\p, (10) 

which quantifies the dependence of Xi on the innovation eq- Out main results depend on the decay rate of 
Sp{i) as i ^ 00. Let p' = min(2,p) and define 



i/p' 



i—n \i—n / 

00 

^p(") = ^'^^'^{Cp'i'p{n),Sp{i)}, 



and 



(11) 



i=0 



where Cp is defined in (30). It is easily seen that 5'p(-) < 6p(-) < Ap(-). We use Qp, ^'p and Ap as shorthands 
for 8p(0), 5'p(0) and Ap(0) respectively. We make the convention that dp{k) = for fc < 0. 

There are several reasons that we use the framework ^ and the dependence measure (10). First, the 
class of processes that ^ represents is huge and it includes linear processes, bilinear processes, Volterra 



processes, and many other time series models. See, for instance, Tong (1990) and Wiener (1958). Second, the 



physical dependence measure is easy to work with and it is directly related to the underlying data-generating 
mechanism. Third, it enables us to develop an asymptotic theory for complicated statistics of time series. 



2. 1 . Maximum deviations of sample autocovariances 



Note that 7^, is a biased estimate of 7^ with £7^. = (1 — |A:|/n)7;;. It is then more convenient to consider the 
centered version maxi<fc<<;^ \/n\jk ~ ^7k\ instead of maxKk<s„ \Ai|7fc ~ 7fc|- Recah ([?]) for a„ and 6„. 

Theorem 1. Assume EXj = 0, e for some p > 4, and 8p(m) = 0{m~°'), Ap(m) — 0{m^°' ) for 
some a > a' > 0. If Sn satisfies s„ — > 00 and Sn = 0{n^) with 



< ?7 < 1, rj<ap/2, and 77 min{2(p — 2 — ap), (1 — 2a')p} < p — 4, 



i/ien /or all x £ 



lim P 

n—^oo 



( max |v/n[7fc - (1 - /s/n)7fc]| < ^a^(a2s„ a; + 62s„) ) 

\ l<fc<s„ / 



exp{— exp(— x)}. 



(12) 



(13) 



In ( 12 ), if p < 2 + or 1 < 2a', then the second and third conditions arc automaticahy satisfied, and 
hence Theorem [l] allows a very wide range of lags s„ — 0{n^) with < 77 < 1. In this sense Theorem [l] is 
nearly optimal. 

For the maximmii deviation maxi<fe<„ [7^ — £7^! over the whole range 1 < fc < n, it seems not possible to 
derive a limiting distribution by using our method. However, we can obtain a sharp bound (n^^ log n)^/^. The 
upper bound is given in (15), while the lower bounded can be obtained by applying Theorem [l] and choosing 
a sufficiently small 77 such that (12 1 holds. Using Theorem [2| [Xiao and Wu (20101 derived convergence rates 
for the thresholded autocovariance matrix estimates. 

Theorem 2. Assume EX, ^ 0, X, e CP for some p > ^, and Qp{m) = 0{m-°'), Ap(m) = 0{m-°'') for 
some a > a' > 0. If 



a > 1/2 or a'p > 2 



then for Cp = 6(p + 4) e^^^ K4 64, 



lim P max \% - E7fe| < Cp 

TH-oo \ l<k<n 



logn 



= 1. 



(14) 



(15) 



Since Op(r?T,) > ^p(7n), we can assume a > a'. For a detailed discussion on their relationship, see Remark 6 



of Xiao and Wu (2010). It turns out that for the special case of linear processes the condition (12 1 can be 



weakened to the following one: 

< 77 < 1, 77 < ap/2, and (1 - 2a)7; < (p - 4)/p. 



(16) 



See Remark [2j Furthermore, for linear processes the condition (14 1 can be relaxed to ap > 2 as well. 

In practice, the mean fi = EXq is often unknown and we can estimate it by the sample mean X„ 
(1/71) X]r=i -^i- '^^^ usual estimates of autocovariances and autocorrelations are 



1 " 

7fc = - {Xi-k - Xn){Xi ~ Xn) and = 7^/70. 
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(17) 



i=k + l 



Corollary 3. Theorem^ and Theorem^ still hold if we replace jk therein by 7fc. Furthermore, 

lim P{ max \y^n[rk - - k/n)rk]\ < (\/a^/7o)(a2s„ a; + 62s„) ) = exp{- exp(-a::)}. 
Proof of Corollary For the 7^ version of Theorem [T] it suffices to show that 

max |Vn(7fc -7fc)| = op ( ^=^= 

l<k<s„ V VlogS,; 



(18) 



Let Sk — By Theorem 1 (iii) of Wu (2007), we have ||maxi<fc<„ 15*4:111 < 2-^7162. Since 

n n n—k k 

{Xi-k — Xn){Xi — Xn) — ^ Xi^kXi = —Xn ^ Xi + Xn ^ Xi — kX^, 
i—k+1 i—k+1 i=l 2—1 

we have (18 1. The proof of the 7^ version of Theorem [2] is similar. The assertion on sample autocorrelations 
can be proved easily, and details are omitted. □ 



2.2. Box- Pierce tests 



Box-Pierce tests (Box and Pierce 1970 Ljung and Box, 1978) are commonly used in detecting lack of fit 



of a particular time series model. After a correct model has been fitted to a set of observations, one would 
expect the residuals to be close to a sequence of iid random variables, and therefore one should perform some 
tests for serial correlations as model diagnostics. Suppose (v'Ci)i<i<„ is an iid sequence, let fk be its sample 
autocorrelations. Then the distribution of Qn{K) '■— "-X^feLi^/c approximately Xk- Logically, it is not 
sufficient to consider a fixed number of correlations as the number of observations increases, because there 
may be some dependencies at large lags. We present a normal theory about the Box-Pierce test statistic, 
which allows the number of correlations included in Qn to go to infinity. 

Theorem 4. Assume Xi e , EXi — and X^fcLo ^^"^sCfc) < 00. // s„ — > 00 and s„ = 0{n^) for some 
(3 < 1, then 



1 



[n{% - (1 - k/n)^kf - (1 - k/n)a^] ^ 0, 2 ^ 



k=l \ keZ / 

To see the connection to the Box-Pierce test, we have the following corollary on autocorrelations. Using 
the same argument, we can show that the same asymptotic law holds for the similar Ljung-Box test statistic 
QLB = n{n + 2)j:k=ifj</in-k). 

Corollary 5. Under the conditions of Theorem^ the same result holds ifjk is replaced by 7^.. Furthermore, 



k=i \ ^0 kez 



(19) 
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Remark 1. Theorem [4] clarifies an important historical issue in testing of correlations. If 7^ = for all 
A: > 1, which means Xi are uncorrelated; then (Jq = 7q and ct^ = for all |/c| > 1, and (19) becomes 



^f;K,-(l-fc/n)] ^AA(0,2). 



(20) 



fe=i 



In an influential paper, Romano and Thombs ( 1996 1 argued that, for fixed K, the chi-squared approximation 
for Qn{K) does not hold if Xi are only uncorrelated but not independent. One of the main reasons is that 
for fixed fi, . . . , fii are not asymptotically independent if Xi are not independent. However, interestingly, 
the situation is different if the number of correlations included in Qn can increase to infinity. According to 
([5|, V^7fc,i \/^lkn+h are asymptotically independent if /i > and fc„ 00, because the asymptotic 
covariance is ct^ = 0. Therefore, the original Box-Pierce approximation of Q„(s„) by , with unbounded 

> TV (0,2) as Sn 00. This 



Sn, is still asymptotically valid in the sense of (20) since {xi 



observation again suggests that the asymptotic behaviors for bounded and unbounded lags are different. A 



similar observation has been made in Shao (2011), whose result also suggests that (20) is true under the 
assumption that Ss{k) — 0{p^) for some < p < 1. Our condition YlV=i k^^sik) < 00 is much weaker. 

The next theorem consists of two separate but closely related parts, one is on the estimation of (Tq = 
X^fcezTfc' ^"^^ other is related to the power of the Box-Pierce test. Define the projection operator 



V'- = E{-\Ti^) - E(-I-Floo ), where J"/ = (e„ e,+i, . . . , e,), i,j € Z. 
Theorem 6. Assume Xi € EXi = and Q4 < 00. //s„ — >■ c» and s„ = o{y/n), then 



(21) 



k — — Sr, 



where D'^ = J^^o T^^i^r^) ^« = 7o^. + 2 EZi lkX,-k- Furth- 



ermore, i 



ifEk=ill>0, then 



(22) 



\fe=l k=l / 
where Da = EZo ^"(^^^0 ^^-^ = J^Zi IkX^-k- 

Corollary 7. Under conditions of Theorem^ the same results hold if is replaced by 74,. Furthermore, 
there exist positive numbers and r| such that 



Vn(Y.fl-Y,rl]^U{Q.rl) and 



E ^l- E 'l] ^-^(0, 



\k=l 



fc=l 



\k— — s 



As an immediate application, we consider testing whether {Xi) is an uncorrelated sequence. According to 

s„(2n - s„ - 1) 



(20), we can use the test statistic 



i{Sn) 



2n 



whose asymptotic distribution under the null hypothesis is A/'(0, 2). The null is rejected when T„ > v^^i-q, 
where zi-q is the (1 — a)-th quantile of a standard normal random variable Z . However, under the alternative 
hypothesis X]fc°=i ^fc t'^^ distribution of T„ should be approximated according to Corollary [Tj and the 
asymptotic power is 

which increases to 1 as n goes to infinity. 
3. A Simulation Study 

Suppose {j'k'^^ ^ sequence of autocorrelations, one might be interested in the hypothesis test that = r^"-* 
for all fc > 1. This hypothesis is, however, impossible to test in practice, except in some special parametric 
cases. A more tractable hypothesis is 

Ho : rk = ri°^ for 1 < A: < s„. (23) 

In traditional asymptotic theory, one often assumes that s„ is a fixed constant, for example, the popular 
Box-Pierce test for serial correlation. Our results in the previous section provide both £°° and £^ based 
tests, which allow s„ to grow as n increases. Nonetheless, the asymptotic tests can perform poorly when 
the sample size n is not large enough, namely, there may exist noticeable differences between the true and 
nominal probabilities of rejecting Hq (hereafter referred as error in rejection probability or ERP). In a recent 
paper, Horowitz et al. (20061 showed that the Box-Pierce test with bootstrap-based p- values can significantly 



reduce the ERP. They used the blocks of blocks bootstrapping with overlapping blocks (hereafter referred as 



BOB) invented by Kiinsch ( 1989 1. For finite sample, our based test is similar as the traditional Box-Pierce 
test considered in their paper, so in this section our focus will be on the C°° based tests. We shall provide 
simulation evidence showing that the BOB works reasonably well. 

Throughout this section, we let the innovations e.i be iid standard normal random variables, and consider 
the following four models. 

I.I.D.: X, = (24) 

AR(1): A, = 6A,_i + (25) 

Bilinear: A^ = (a + feei)Aj_i + u (26) 



ARCH: A, = y'a + hX'l_^ ■ e,. (27) 

We generate each process with length n = 2 x 10^, and compute 

a^/ ( max |rfe - (1 - fc/n)rfc| /a/S^ - 62s„ ) (28) 
9 



with Sn = 5 X 10^ and (Tq = X]l=-t '^fcj where t„ is chosen as i„ — [n^/^J = 271. Based on 1000 repetitions, 
we plot the empirical distribution lunctions in Figure [T] We see that these four empirical curves are close to 
the one for the Gumbel distribution, which confirms our theoretical results. 

n = 2e+07 lags = 5e+05 




Fig 1. Empirical distribution functions for quantities in {ggp . We choose b = 0.5 for model j'25p , a = b = 0.4 for model ^20j 
and a = b = 0.25 for model (27]). The black line gives the true distribution function of the Gumbel distribution. 



One the other hand, these empirical distributions are not very close to the limiting one if the sample size 



is not large, because the Gumbel type of convergence in ( 13 ) is slow. This is a well-known phenomenon; see 



for example Hall ( 1979 1 . It is therefore not reasonable to use the limiting distribution to approximate the 



finite sample distributions. To perform the test (23), we repeat the BOB procedure as described in Horowitz 



et al. (2006) (called SBOB in their paper). Since in the bootstrapped tests, the test statistics are not to be 



compared with the Hmiting distribution, we can ignore the norming constants in ( 28 ) and simply use the 
following test statistics 



max 

l<fe<s„ 



rfc - (1 - fc/n)r 



and Mn 



where Mn is the self-normalized version with ctq estimated as cto = X]fc=_t ^fe; with t„ = min{[n^^'^J , s„}. 

For simplicity, we refer these two tests as M-test and 7W-test, respectively. 

From the series Xi , . . . , Xn , for some specified number of lags s„ that will be included in the test and 

block size b„, form Yi = {Xi, X^+i, . . . , X^+s^)^, 1 < z < n — s„ and blocks Bj = (Yj , Yj+i, . . . , Yj+b„-i)i 

1 < j n — Sn — -\- 1. For simplicity assume /i„ = ?i/b„ is an integer. Suppose Yjj is obtained by sampling 
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a block from the set of blocks {Bi, B2, • • • , Bn-s„-bn+i}j a-nd then sampling a column from B^, let Covj 
represent the covariance of the bootstrap distribution of Yj, conditional on {Xi,X2, ■ ■ ■ ,Xn). Denote by 
the j-th entry of Yj , set 



y^Covj(y/,yji).Cov5(r/+\ 



Y, 



k+l\ 



(e) 

The explicit formula of r , was also given in 



Horowitz et al. 



(2006). The BOB algorithm is as follows. 



1. Sample /i„ times with replacement from {Si, S2, . . . , Bn~s„-b„+i} to obtain blocks {Bl,B2, 
which are laid end-to-end to form a series of vectors ( Yj* ,Y2 , . . . ,Y*). 

2. Pretend that (Yi ,Y2 , ■ ■ ■ ,Y*) is a random sample of size n from some s„-dimensional population 
distribution, let r'^ be the sample correlation of the first entry and the {k + l)-th entry. Then calculate 



the test statistic M* = maxi<fc<s 



and Ml = M*/y^, where = El=-i„ (rlf ■ 
3. Repeat steps 1 and 2 for N times. The bootstrap p-value of the M-test is given by #(M* > Mn)/N . 
For a nominal level a, we reject Hp if > Mn)/N < a. The Al-test is performed in the same 

manner. 

We compare the BOB tests and the asymptotic tests for the four models listed at the beginning of this 
section, with a = .4 for (25), a ~ b = A for (26) and a = 6 = .25 for (27). We set the series length as 
n = 1800, and consider four choices of s„: Llog("^)J = 7, [rt^/^J = 12, [^J = 42 and 25. The BOB tests 
are performed with N ~ 999, and the asymptotic tests are carried out by comparing {y/nA4n — &2s„) 
with the corresponding quantiles of the Gumbel distribution. The empirical rejection probabilities based on 
10,000 repetitions are reported in Table [T] All probabilities are given in percentages. For all cases, we see 
that the asymptotic tests are too conservative, and the ERP are quite large. At the nominal level 1%, the 
rejection probabilities are often less than or around 0.1%, and at most 0.51%; while at nominal level 10%, 
they are often less than 3% and at most 6.4%. Except for the bilinear models with s„ = 7 and s„ = 12, the 
bootstrapped tests significantly reduce the ERP, which are often less than 0.2% at nominal level 1%, less 
than .5% at level 5%, and less than 1% at level 10%. The performance of A/-test and A^-test are similar, 
with the former being slightly more conservative. The BOB tests are roughly insensitive to the block size. 



which provides additional evidence of the findings on BOB tests in Davison and Hinkley (1997). 

The bootstrapped tests still perform relatively poorly for bilinear models when s„ is small (7 and 12). 



This is possibly due to the heavy-tailedness of the bilinear process. Tong ( 1981 ) gave necessary conditions 



for the existence of even order moments. On the other hand, Horowitz et al. (2006) showed that the iterated 
bootstrapping further reduce the ERP. It is of interest to see whether the iterated procedure has the same 
effect for the C°° based tests, in particular, whether it makes the ERP reasonably small for the bilinear 
models when s„ is small. The simulation for the iterated bootstrapping will be computationally expensive 
and we do not pursue it here. 
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Table 1 

Empirical rejection probabilities (in percentages) 



Test 






7 






12 




Sfi — 


25 






42 




1 


5 


10 


1 


5 


10 


1 


5 


10 


1 


5 


10 


I.I.D. 


.00 


.34 


1.6 


.02 


.69 


2.3 


.03 


.93 


3.2 


.04 


1.0 


3.3 


bn = 5 


1.3 


5.1 


10.0 


1.1 


5.2 


9.8 


.95 


4.7 


9.3 


1.0 


4.7 


9.6 




1.4 


5.3 


10.4 


1.2 


5.6 


10.5 


1.1 


5.1 


10.1 


1.1 


5.1 


10.2 


b„ = 10 


.83 


4.8 


10.0 


1.1 


4.9 


9.6 


1.1 


4.9 


10.1 


.65 


4.3 


8.9 




.94 


5.1 


10.3 


1.2 


5.4 


10.3 


1.1 


5.5 


11.0 


.78 


4.7 


9.6 


AR(1) 


.01 


.17 


1.2 


.01 


.36 


1.8 


.02 


.77 


2.5 


.02 


.88 


2.8 


b„ = 10 


1.3 


5.7 


10.9 


1.3 


5.5 


11.4 


1.3 


5.5 


10.9 


1.1 


5.7 


11.2 




1.3 


5.7 


11.2 


1.4 


5.9 


11.7 


1.3 


6.0 


11.5 


1.2 


6.0 


11.7 


b„ = 20 


.98 


5.5 


10.9 


1.0 


5.8 


11.3 


1.1 


5.3 


10.6 


.86 


4.9 


10.5 




1.0 


5.7 


11.0 


1.1 


6.1 


11.9 


1.2 


5.6 


11.0 


.83 


5.0 


10.9 


Bilinear 


.34 


2.8 


6.4 


.43 


2.5 


5.8 


.51 


2.5 


5.9 


.40 


2.8 


5.9 


b„ = 10 


2.8 


8.7 


14.4 


1.8 


7.1 


12.7 


1.2 


6.1 


12.0 


1.2 


5.4 


10.9 




2.7 


8.6 


14.5 


1.8 


7.3 


12.9 


1.3 


6.2 


12.2 


1.1 


5.5 


11.1 


bn = 20 


2.7 


8.4 


14.6 


2.1 


7.2 


13.5 


1.5 


6.3 


12.0 


1.3 


5.2 


10.8 




2.5 


8.3 


14.6 


2.1 


7.5 


13.9 


1.5 


6.2 


12.0 


1.2 


5.3 


10.9 


ARCH 


.05 


.82 


3.2 


.06 


1.5 


3.9 


.09 


1.3 


4.0 


.12 


1.4 


4.4 


b„ = 10 


.99 


5.0 


10.5 


1.2 


4.9 


9.7 


.80 


4.6 


9.9 


.82 


4.7 


9.3 




1.1 


5.4 


10.9 


1.4 


5.3 


10.4 


.92 


5.1 


10.7 


.94 


5.1 


10.2 


b„ = 20 


.86 


5.1 


10.5 


1.0 


5.0 


10.3 


.69 


4.8 


9.7 


.63 


4.3 


8.9 




.98 


5.5 


11.0 


1.2 


5.6 


11.0 


.89 


5.1 


10.4 


.76 


4.7 


9.5 



The values 1, 5, 10 in the 2nd row indicate nominal levels in percentages. The numbers in the third 
row starting with the model name "I.I.D." are for the asymptotic tests. The fourth row staring 
with bn = 5 is for BOB M-tests with block size 5. The fifth row is for BOB jM-tests with the same 
block size 5. Other rows should bo read similarly. 



4. Proofs 

This section provides proofs for the results in Section [2] For readabihty we hst the notation here. For a 
random variable X, write that X e CP, p > 0, if \\X\\p := {E\X\Py^P < oo. Write ||X|| = ||X||2 if p = 2. 
To express centering of random variables concisely, we define the operator Eq as EqX X — EX. For a 
vector X = (xi, . . . , x^)^ G let \x\ be the usual Euclidean norm, |a;|oo ■— maxi<i<d |a;i|, and \x\, := 
TohiKiKd \xi\. For a square matrix A, p{A) denotes the operator norm defined by p{A) max|^|=i \Ax\. Let 
us make some convention on the constants. We use C, c and C for constants. The notation Cp is reserved for 



the constant appearing in Burkholder's inequality, see (30). The values of C may vary from place to place, 
while the value of c is fixed within the statement and the proof of a theorem (or lemma) . A constant with a 
symbolic subscript is used to emphasize the dependence of the value on the subscript. 
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The framework (|9]) is particularly suited for two classical tools for dealing with dependent sequences, 
martingale approximation and m-dependence approximation. For i < j, define J-"/ = (e.;, Ci+i, . . . , e^) be the 
(T-field generated by the innovations e^, e^+i, . . . , e^, and the projection operator "H^ (•) = E(-|J-"^). Set J-i := 
:= J^'Looj s-nd define Hi and W similarly. Define the projection operator {■) = W{-) — W^^{ ), 
and Vi{-) — 'Hi{-) — 'Hi+i{-), then {V^{-))jei and {V-i{-))iez become martingale difference sequences with 
respect to the filtrations {J-'-') and (J^^i), respectively. For m > 0, define Xi = Hi-mXi, then {Xi)i^z, is a 
(m + l)-dependent sequence. 



4-1. Some Useful Inequalities 



We collect in Proposition [8] some useful facts about physical dependence measures and martingale and in- 
dependence approximations. We expect that it will be useful in other asymptotic problems that involve 
sample covariances. Hence for convenience of other researchers, we provide explicit upper bounds. 



We now introduce a moment inequality ( 29 1 which follows from the Burkholder inequality (see Burkholder 



1988). Let (Di) he a martingale difference sequence and for every i, Di ^ C^, p> 1, then 
\\Di+D2 + - 



■+D,r^ <q' (^\\D,r; + \\D,\\'^ + ■■■ + 

where p' — min{p, 2}, and the constant 

Cp = (p - if 1< p < 2 and = \/p-l ff p > 2. 



(29) 



(30) 



We note that when p > 2, the constant Cp in (291 equaled to p — 1 in Burkholder ( 1988 1 , and it was improved 



to by Rio (2009) 



Proposition 8. 1. Assume KXi = and p > 1. Recall that p' = min(p, 2). 

\\V''X,\\p<Sp{i) and H^^Jp < <5p(z) 

Kp '■= ll-'^^ollp < Cp'^p 



< CpAnQp, where = ( ^ \cif 



i/p' 



\lk\ < C2(fc), where (p{k) := ^6p{j)Sp{j + k) 



< 2Cpf2KpQp\/n, when p> A 



p/2 



< 4Cp/2CpQpBny/n, when p > i 



(31) 
(32) 

(33) 
(34) 
(35) 

(36) 



p/2 



where ^ max{maxi<j<„ J^l^i <^i.j^ maxi<j<„ J27=i ^L} 



j=i -ij' j-iji" ^1=1 
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2. For m > 0, define Xi = T-Li^mXi. For p > 1, let Sp{-) be the physical dependence measures for the 
sequence (Xi). Then 



5p{i) < Sp{i) 

\\XQ-Xo\\p<Cp'^p{m + l) 



< CpAr^Opim + 1) 



[x.^kX., - 7fc - X^-kX^ + % 

i=fc+l 



< 4Cp(n - fc)i/p'K2pA2p(m + 1). 



(37) 
(38) 

(39) 
(40) 



Proof. The inequalities (31 1 and (37) are obtained by the first principle. Since Xi^j. = J2jez'^''-^i~k and 



Y,E[iV-^Xo){V-^Xk)] 

j=-k 



< S2{j)S2U + k) < Ck, 



which proves (34). For (36), it can be similarly proved as Proposition 1 of Liu and Wu (2010), and (39) was 



given by Lemma 1 of the same paper. (33) is a special case of (39). Define Yi = Xi-kXi, then (Yi) is also a 



stationary process of the form (l9|). By Holder's inequality, H^i — i^o(i^i)||p/2 < '2Kp[6p{i) + Sp{i — k)]. Applying 



(33) to (Yi), we obtain (35). To see (38), we first write X^n — Xj^ — J^^i'P-jXm- Since ||'P-j-'^m||p < 



5p{m + j), and {'P-jXm)j>i is a martingale difference sequence, by (29), we have 



1^0 - XoW'p < < E W^-^Xrufp < < J2^Sp{m+j)]P' = q'[%im + 1)]"' . 



The above argument also leads to ( 32 ) . Using a similar argument as in the proof of Theorem 2 of Wu ( 2009 ) , 



we can show (40). Details are omitted. 



□ 



4.2. Proof of Theorem\l\ 

The proof is quite complicated and will be divided into several steps. We first give the outline. 



4-. 1.0. Outline 

Step 1: m-dependence approximation. Define Rn^k = 'Y^^i=k+iiXi-kXi — 7fc). Set m„ = [n'^J, < /? < 1. 
Define Xi — Hi-mnXi, % ~ E{XoXk), and Rn,k — J2^i=k+i(Xi~kXi — 7*;). We next show that it suffices to 
consider Rn,k- 

Lemma 9. Assume EXi = 0, Xi e £p, and Qp{m) = 0{m^°') for some p > 4 and a > 0. If s„ — 0{nV) 
with < rj < ap/2, then there exists a j3 such that rj < p < I and 



max 

l<fe<s„ 



Rn,k Rn.k 



= Op 



\J nj log 
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Step 2: Throw out small blocks. Let Z„ = ["-^Ji where 7 € (/3,1)- For each t„ < fc < s„, we apply the 
blocking technique and split the integer interval [fc + 1, n] into alternating large and small blocks 



(41) 



Ki = [fc + l,s„] 

Hj = [sn + [j - l)(2m„ + /„) + 1, s„ + [j - l)(2m„ + /„) + /„]; l<j< w„ - 1, 
K.j+1 = [sn + (j - l)(2m„ + + In + 1, s„ + j(2m„ + ;„)]; 1 < j < w„ - 1; and 
Hw^ = [s„ + {wn - l)(2m„ + Z„) + 1, 71], 

where w„ is the largest integer such that s„ + {wn — l)(2m„ + In) + ln < n. Denote by \H\ the size of a block 
H. By definition we know /„ < |i?u;,J < 3Z„ when 71 is large enough. For I < j < Wn define 



E 



Xi_feXi - 7fe ) and [/^.j = ^ (^Xi^^Xi - 7^ 



Note that Wn ~ n-/(2m„ + /„) ~ n^"''. We show that the sums over small blocks are negligible. 
Lemma 10. Assume the conditions of Theorem^ Then 



max 

l<fc<s„ 



Op 



log Sn , 

Step 3: Truncate sums over large blocks. We show that it suffices to consider 
Lemma 11. Assume the conditions of Theorem^ Then 



max 

l<fc<s„ 



Op 



log 



Step 4- Compare covariance structures. In order to prove Lemma 14 we need the autocovariance structure 
of {Tin,k/\/n) to be close to that of (Gfe). However, this only happens when fc is large. We show that there 
exists an < t < 1 such that for i„ — 3[sJ:J, (i) maxi<fe<(^ \T^n,k/\/n\ does not contribute to the asymptotic 
distribution; and (ii) the autocovariance structure of {TZn,k/ \/n) converges to that of (Gfc) uniformly on 

tyi <^ fc ^ Sji. 

Lemma 12. Under conditions of Theorem^ there exists a constant < i < 1 such that for tn = 3[Sn\ , 



lim P ( max \TZn,k\ > \/ o-q"- log s„ I = 0. 



i<fe<t 



(42) 



Lemma 13. Let conditions of Theorem^ be satisfied. Recall that tn = 3[s5:J from Lemma 12 There exist 
constants Cp > and < ^ < 1 such that for any tn < k < k + h < Sn, 

I Cov {TZn, k, Tin, k+h)/n - ah\ < C^Sn^. 
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Step 5: Moderate deviations. Let i„ = 3[s'J be as in Lemma 12 For i„ < /ci < ^2 < . . . < fc^ < s„, 
define TZn = (7^.„,fcl,7^„,fc2, . . . ,7^„,fe^)^ and V = (Gfe, , Gfe^ , . . . , Gfe^)^, where {Gk) is defined in ([e]). Let 
E„ = Cov(7?.„) and S = Cov(V). For fixed x e M, set z„ = o-2s„^ + ^2s„; wliere tfie constants a„ and 6„ are 
defined in ([7|. In tire following lemma we provide a moderate deviation result for TZn- 

Lemma 14. Assume conditions of Theorem [7] Then there exists a constant Cp^d > 1 such that for all 
t„ < fci < fc2 < ■ • • < fed < s„, 



\P{\nn/Vnl > Zn)-P{\V\, > 2„)| < 



Pi\V\,>Zn 

(l0gS„)l/2 



Gp,d exp < - 



(log Sn)^ 



4-2.1. Step 1: m-dependence approximation 

Proof of Lemma^ Recall that to„ = [n^ \ with rj < (3 < 1. We claim 

< 6Cp/20p6p(m„ - fc + 1) • ^/n. 



Pn,k Pn 

It follows that for any A > 

k Pji^k 



p/2 



(43) 



P I max 

, l<fe<s„ 



> Av/n/lo, 



nP/4AP/2 



i?, 



||P/2 
.,*;l!j,/2 



k=l 



< CpA~P/2s„(logs„)''/'*n-"^P/2 < C'pA-P/2n''-"''P/2(logn)P/^ 



Therefore, if ap/2 > rj, then there exists a /3 such that rj < f3 < 1 and r] — a(5p/2 < 0, and hence the preceding 
probability goes to zero as n — ^ oo. The proof of Lemma |9] is complete. 



We now prove claim (|43p. For each 1 < fc < s„, we have 

n 



\Rn,k — Rn,k\\p/2 < 



i=k+l 



p/2 



i=k+l 



p/2 



Eq {Xi^k — 'Hi-m„Xi^k){Xi — Xi) 



i=k+l 



p/2 



Observe that {XiVi-k-jXi^k)i<i<n is a backward martingale difference sequence with respect to J-i-k-j if 



j > rUn , so by the inequality ( 29 1 , 



E i-^i-k — Xi^k)Xi 
i=k+l 



E XiVi-k-jXi^k 
i=k+l 



oo 

^ E 

oo 

- E V^<^p/2||-^J + fe'Po^j|lp/2 
j—m-\-l 



p/2 



< Cp/20p6p(m„ + 1) 



Similarly we have || Yl7=k+ii'^i-rn„Xi^k){Xi - ^i)llp/2 < \/^Cp/26p6p(m„ + 1). Similarly as (39), we get 
- Hi-m„Xi^k\\p < G>p{mn - fc + 1). Let Yn,i ■■= {Xi^k - l-Li-r,i„Xi^k){Xi - Xi). Then 

\\Yn,^ - VLo{YnMp/2 ^ 2 [5p{i)Qp{mn - fc + 1) + 5p{i - fc)ep(m„ + 1)] . 
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Therefore, by (33), it follows that 



i=/c+l 



< 4Cp/26p6p(m„ - k + I) ■ y/n, 



p/2 



and the proof of ( 43 1 is complete. 



□ 



4-2.2. Step 2: Throw out small blocks 

In this section, as well as many other places in this article, we often need to split an integer interval 
[s,t] ~ {s, s + 1, . . . ,t} C N into consecutive blocks Si, . . . with the size m. Since s — t + 1 may 
not be a multiple of to, we make the convention that unless the size of the last block is specified clearly, it 
has the size to < \Bw\ < '2m, and all the other ones have the same size to. 

Proof of LemmaUR It suffices to show that for any A > 0, 



lim Vp 



> A 



log s„ 



= 0. 



Observe that Vkj, 1 < J < Wn, are independent. By (35l, ||Vfej|| < 2\Kj\^^^K4Q4. By Corollary 1.6 of Nagaev 



(1979), for any M > 1, there exists a constant Cm > 1 such that 



P 



3 = i 



> A 



log.s„ / J^l ^ ^ V C'jv^ ^2„/logs„ 



Cm 



(44) 



where we resolve the constant A into the constant Cj\/ in the last inequality. It remains to show that 



1™ EE-^d^/^jl - li^^n) = 0, where (/)„ 

k=lj = l 



logn 



(45) 



holds for any (5 > 0, where qi is the smallest integer such that P"^^ < min{(p — 4)/p, (p — 2 — 2r})/{p — 2)}. 
This choice of qi will be explained later. We adopt the technique of successive TO-dependence approximations 



from Liu and Wu (20101 to prove (45). 



For g > 1, set to„_, = [n'^'J. Define X^^q = U^-rn^^^Xi, jk,q = K{Xo,qXk,q), and 

^k,j,q = E {^i-k,qXi^q ~ Jk.q)- 

In particular, m„^i is same as to„ defined in Step 2, and Vk^^i ~ Vkj. Without loss of generality assume 

Sn < L"-''J • Let go be such that /3''°+^ < rj < We first consider the difference between Vk.j^q and Vkj.q+i 
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for 1 < q < qo- Split the block Kj into consecutive small blocks Bi, . . . , Bw„ ^ with size 2m„.q. Define 



(0) _ 



iXi-k,qXi,q - Jk.q) and V^j^gj. — ^ {X^-k,q+lXi^q+l - 7fe,g+l). 



^kj,q,t ~ [-^^-k,q^i.,q " Ik,. 



(46) 



Observe that VllJ^ ^^ and V^j- are independent if |ti — ^2! > 1- Similar as HAl, for any M > 1, there 
exists a constant Cm > 1 such that, for sufficiently large n, 



Pi\Vk,j,q-Vk,j,q + l\ > 5(bn)^P 



k,j,q,t k,j,q,t 



> 



< 



(0) 



(1) 



j,g,t k,j,q,t 



> C 



(47) 



-M 



Similarly as ( 43 ) , we have 



(0) 



k,j,q,t k,j,q,t 



(1) 



p/2 



< Cp|i3t|i/2^-^^i. It follows that 



k=i]=i 



f 'Y P/4 -ap/2 



rrin.qin/ log n)P/'^ J 



0. 



Under the condition (16), there exists a < /? < 1, such that 

fc=l j=l 

Recall that qi is the smallest integer such that /3''i < min{(p — 4)/p, (p — 2 — 2rj)/{p — 2)}. We now 
consider the difference between Vk,j,q and Vk.j,q+i for qo < q < qi- The problem is more complicated than 
the preceding case 1 < q < qQ, since now it is possible that m„ ^ < k for some 1 < fc < Sn- We consider three 
cases. 

Case 1: k > 2to„ ,j. Partition the block Kj into consecutive smaller blocks Si, . . . ,i3to„ , with same size 

is a martingale difference 



t is odd 



m„^,. Define t/.^J^ , and , as in g. Observe that (y.^J^ , - F^,, 

sequence with respective to the filtration {S,t (e; : ' < max{St}))j ^^^j, and so is the sequence and fil- 
tration labelled by even t. Set Co = (e; ^ ^ < miii{Si}) and — {ei : I < min{Si} — Wn.q)- For each 
1 < t < w„_q, define 

2 



il,i2eBf 



for ^ = 0, 1. By Lemma 1 of Haeusler (|1984|), for any M > 1, there exists a constant Cm > 1 such that 

.(1) 



P{\Vk.j,q - Vk.j,q+l\ > 50„) < ^ P 

+ e4^ 



k,j,q,t k,j,q,t\ 



> 



/=0,1 



E H^^' > 



(0 ^ C'a/" 



(logn)' 



is odd 



(logn)2 
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P 



.t is even 



(0 > C'a/" 



(logn)2 



(48) 



By M, EfeGzl7fe,,+/|' < Ql and hence by M, \\V^'^\\p/2 < Cpm^Cq. Observe that and V^^ are 



1/2 



;(0) 



;(0) 



independent if |ti — > 1, so similarly as (|44|), we have 
P 



j2 vi"> ^^^'"^ 



.£ is odd 



(logn)2 



The same inequality holds for the sum over even t. For the first term in p8|), we claim that 

.(1) 



(0) 



'k,j,q,t ^k,j,q,t 

which together with the preceding two inequalities implies that 

P m,j.q - Vk,j^,+,\ > <50„) < Cp,M wn^, ■ n-P/2(logn)3p/2 . ^-^^^ + ^P/ 



(49) 



-M 



It follows that under condition (16), there exists a < /? < 1 such that 



(50) 



< „l+';-M + c^^^^ . „l+';-P/2(logn)3p/2 J„;3'(p/2-l-a/3p) ^ ^/3'(p/4-l)j ^ 

Case ^; fc < TO„g+i/2. Partition the block Kj into consecutive smaller blocks with size 

Smn,q- Define V^^^ ^ and V^^^'' ^ as in (46|. Similarly as (43), we have 



(0) 



k,j,q,t k,j,q,t 



(1) 



p/2 



Similar as (|47|, for any M > 1, there exist a constant Cm > 1 such that 

.(0) 



(1) 



-M 



< 



n-^^ + Cp.M ■ w^.,q ■ n-P/4(logn)f/4 • mj^/^ . m-^f/^. 



It follows that that under condition (16), there exists a < /3 < 1 such that 

E E^(i^'=.j-^9-^'=.j.«+ii^'5'^") 
fc=i i=i 

p/4-Q0p/2 



(51) 



< ni+"-^^ + Cp,M • ni-P/4(logn)f/4 • (n^') = o(l) 



Case 3: mn^q+i/'i, < k < 2m„^q. We use the same argument as in Case 2. But this time we claim that 



(0) 



k,j,q,t k,j,q,t 



(1) 



p/2 



<Cr, 



^n^q ■ m-n"q+l + mn^qCp{k) 



(52) 
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where Cp(fc) is defined in Since J2T=m[Cpik)]P^^ < [J2T=m Cp(fc)]^^^ = 0{m-"P/^), under condition 
there exist constants Cp,i\i > 1 and < (3 < 1 such that for M large enough 



2m„ g — 1 

E 

fc>m„.<j + i/2 j = l 



71, q 



2m„ g — 1 
fe>in„_,+i/2 

< n^+^-^' + CpM ■ ni-f/4(logn)P/4 • mP/^-i-^^^P/^ = o(l). 

.(0) 



(53) 



Alternatively, if we use the bound from (40), 



(1) 



p/2 



1/2 — ^ 

< CpTriri.q ■ "^„ g+i, it is still true that 



under condition (12 1, there exist constants Cp^M > 1 and < /3 < 1 such that for M large enough 

2m„,g — 1 M)„ 

E E^(I^M.^-^M,.+ ll>^'^n) 

< n^+V~M ^ (j^^^^ . „l-p/4(log„)p/4 . ^p/2-l-a'/3p/2 ^ 



(54) 



Combine (50), (51), (53) and (54 1, we have shown that 



lim V V P {\Vk,j.q - Vk,j.q+l\ > 5^n) - 0. 



k=ij=i 



for 1 < g < qi. Therefore, to prove (45), it suffices to show 



lim ££p(|yfc,,,,J>5<^„)=0 



(55) 



(56) 



fc=i j=i 

By considering two cases (i) 2m„^gj < fc < Sn and (ii) 1 < fc < 2mn,q^ under the condition < min{(p - 



4)/p, (p — 2 — 2ri)/(p ~ 2)}, and using similar arguments as those in proving (55), we can obtain (56). The 
proof of Lemma [To] is complete. 



We now turn to the proof of the two claims (|49|) and (|52|). For (|52|), we have 
(1) 



k,j,q,t 'k,j,q,t 



< 



p/2 



{Xi-k,q — Xi-k,q+l)Xi^q+l 



leBt 



p/2 



Eq [Xi_fc^q+l(Xi_g — Xi^q+l)] 



ieBt 



p/2 



Eo [{Xi^k,q — Xi-k,q+l){Xi^q — X^^g+l) 



^■.1 + 11 + III. 



p/2 



Similarly as in the proof of (43), we have 



I < Cp/20pQp{mn,q+i + 1) • y^3mn,q and /// < 4Cp/26j,6p(TO„,g+i + 1) • y^3m„ 
For the second term //, write 



Eo [Xi-fe,g+l(Xi,9 — Xi,g+l)] — ^ ^ Eo [(P,;-A..-iiXi_fe)('Pi_/2Xi)] 

/i=0 l2=in„,q+i + l 
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For a pair (^i, I2) such that i ~ k — li ^ i ~ I2, hy the inequaUty (29), we have 



< 



p/2 



For the pahs {li, I2) such that i — k — li — i — I2, hy the triangle inequahty 



J2 J2 ^om^k-ix,-k)iv,-k-iXi)] 

ieBt 1=0 



ni 71, <j + l 



<3mn,q-2 ^ Sp{l)dp{k + I) < 6mn^qCp{k). 



p/2 



1=0 



Putting these pieces together, the proof of (52) is complete. The key observation in proving (49) is that since 



k > 2TO„^g, Xi^k.q and Xi^q are independent, hence the product Xi^k.qXi^q has finite p-th moment. The rest 



of the proof is similar to that of ( 52 ) . Details are omitted. 



□ 



Remark 2. Condition (12) is only used to deal with Case 3, while (16) suffices for the rest of the proof. In 
fact, for linear processes, one can show that the term mn^qCp{k) in (|52[) can be removed, so we have (53 ) under 



condition ( [16^ and do not need (54). So (16) suffices for Theorem [T] Furthermore, for nonlinear processes 



with dp{k) = O [fc-(i/2+a)j^ ^i^g m„,,Cp(fc) 

can also be removed from i52l. Details are omitted 



4-2.3. Step 3: Truncate sums over large blocks 



Proof of Lemma 11 We need to show for any A > 



lim Vf 

71. — VCiT, ' ^ 



k=l 



> AW-^ I =0. 

log Sr. 



Using ( 35 ) , elementary calculation gives 



Uk,i — Uk, 



< 



E\U,J'^/^ ^ (2Cp/2A^pep)P/2|i?,f/4(l0gS„)3(P-4)/2 



(V^/log s„)P/ 



2-2 



7,(p-4)/4 



Similarly as (44), for any M > 1, there exists a constant Cj\/ > 1 such that 



P 



3 = 1 



> A 



log s„ 



log s„ 

\ Cm 12 



log s„ 



-M 



Therefore, it suffices to show that for any 5 > 0, 



fc=ii=i 



1™ EE^(|f^fe^^-'^^-^^i^'5 



logn 



Since we can use the same arguments as those for (45), Lemma 11 follows 
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(57) 



□ 



4-2.4- Step 4- Compare covariance structures 



Proof of Lemma\T^ Since \Uk,j\ < 2v^/(logs„) and EUl^ < EUlj < 4(k4Q4)^\Hj\, by Bernstein's in- 



equality (cf. Fact 2.3, Einmahl and Mason 1997), we have 



P {\Ti'n,k\ > a/ (Ton log Sn^ < CXp ■ 



((To"-logs„)/2 



4(K4e4)2n + n^cro/(logs„) 



Therefore, for any < l < a-o/[8(K484)^], (42) holds. 



□ 



Proof of Lemma\T3\ For 1 < j < w„, by (57), we have 



E{UkjUk+h,j) - nUk,jUk+h.j) < ||t/fej - (7fc,,|l|lC7fe+^j|l + \\UkJ\\Uk+hj - Uk+hJ 



^^ {2Cp/2^,epr/^\H,\P/^i\ogSn)'(p 



<^^iQ^\Hn ^(p_4)/8 
<Cp|i?j|n-(i-^)(P-4)/8(logn)3(f-4)/4. 



-4)/4 



Let Skj = Lie/Zj (-'^i-fc-'^j - 7fe), by (|35|) and (|43|), we have 



E{SkjSk+h,j) — E{UkjUk+h.j) < \\Skj — Uk.j\\\\Sk+h,j\\ + \\Uk.j\\\\Sk+h,j — Uk+hj\\ 

< AK4e4\Hj\^^'^ ■ 6e4e4(m„ -k + < C|77j|n-°^. 

Since 84(771) ~ 0{m^"), elementary calculation shows that A4(m) = 0{n^°' which together with 

Lemma [24| implies that if /c > i„, 

E(Uk,jUk+h,j)/\H^\ -<7h\< el (l6A4(t„/3 + 1) + 6647^;; + 4*4(tn/3 + 1)) 

Choose e such that < ^ < min{(l - 77)(p- 4)/8, a/?, ^^^/(l + (1 - i)7/2, 7- Then 
\CoYiTZ„,k,n„,k+h)/n-ah\ < Cp(n-(i-'')(f-4)/8(logn)(f-4)/4 + n-"'^ 



and the lemma follows. 



□ 



4-2-5- Step 5: Moderate deviations. 

Proof of Lemmaim Note that for x,y e M"*, \x + y\, < \x\, + \y\. Let Z 7V(0,/rf) and 0^ = (logs„)"^ 



Since \Uk,j\ < 2yri/(log s„)^, by Fact 2.2 of Einmahl and Mason (1997), 



P{\nn/Vn\, > Zn) < P{H!^Z\, > Zn-9n)+P{\Tln/V^-^lpZ\ > 9n] 
< P(|sy'Z|. > z^-9^) + Cp,d exp{-Cp-i(logs„)2} 
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By Lemma 23 the smallest eigenvalue of E is bounded from below by some Cd > uniformly on 1 < /si < 

we have p{Til/^ — T,^^'^) < ■ /9(E„ — E) < Cp,d s~^, where the first inequality 



k2 < ■ ■ ■ < kd- By Lemma 



13 



is taken from Problem 7.2.17 of Horn and Johnson (1990). It follows that by (74) and elementary calculations 
that 



> 



< P(|Ei/2Z|. > z„ - 29,,) + CpM exp {fi-'} 



By Lemma 22 we have 

>z„~2(?„) < [l + Cp,rf(logs„)"'/'] P(|Ei/2Z|. >z„). 
Putting these pieces together and observing that V and E^/^Z have the same distribution, we have 
P{\njV^\, > z„) < [l + Cp,d(logs„)-'/'] Pi\V\, > z„)+Cp,d exp{-C-i(logs„)2}, 



which together with a similar lower bound completes the proof of Lemma 14 



□ 



4-2.6. Proof of Theorem\^ 

After these preparation steps, we are now ready to prove Theorem [T] 
Proof of Theorem^ Set Zn = a2s„ x + 62s„- It suffices to show 

lim P max \TZk/V^\ ^ \/o\)Zn ] = exp{— exp(— x)}. 

ri— i-oo \ t„<fc<s„ / 



(58) 



Without loss of generality assume ctq — 1- Define the events Ak — {Gk > Zn} and Bk — {Rk/^/n > Zn}. Let 

Qn,d= Pi^k^ n ■ ■ ■ n Ak,) and Q„^rf= J2 P(Bfci n---nBfcJ. 

t„<fcl<...<fcd<S„ t„<fci<...<fcd<s„ 



By the inclusion-exclusion formula, we know for any q > 1 

2q 

i-l)''-'Qn.d<Pi 

t„<k<s„ 



V(-l)''"'Q„,d < P max \nk/V^\ > a2s„ x + b^s^ < V (-l)''"'g„,d 



2q-l 



(59) 



By Lemma 14, \Qn,d — Qn,d\ < Cp. d(log s„)^^/^(5„^rf + s^^. By Lemma 20 with elementary calculations, we 
know lim„_>.oo Qn^d = e^''^/d!, and hence lim„_>.oo Qn,d = e^'^^/d!. By letting n go to infinity first and then 
d go to infinity in (59), we obtain (58), and the proof is complete. □ 



4.3. Proof of Theorem^ 

Proof of Theorem^ We start with an m-dependence approximation that is similar to the proof of Theoremjl] 

Set m„ = [n'^\ for some < /3 < 1. Define Xi = 'Hi_,„„Xj, 7^ = E(XoXfe), and R^.k = I]"=fc+i(-^i-fe^» - 
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7fc). Similarly as the proof of Lemma 10 we have mider the condition (14) 



max \Rn,k - Rn.k\ = Op 
l<k<n 



n/log . 



For i?„.fc, we consider two cases according to whether fc > 3m„ or not. 

Case 1: k > 3m„. We first split the interval [k + 1, n] into the following big blocks of size {k — rrin) 



Hj = [k + i - l{k — m„) + 1, fc + j{k — m,^ 
Hw„ = [fc + (wn - l)(fc - rrin) + l,n], 



for 1 < j < Wn — 1 



where w„ is the smallest integer such that k + Wn{k — m„) > n. For each block Hj^ we further split it into 
small blocks of size 2to„ 

= [fc + (j - l)(fc - m„) + (/ - l)2m„ + 1, /c + (j - l){k - nin) + 2/m„] for 1 < / < vj 
Kj^y^ = [k + {vj - l){k - m„) + l)2m„ + 1, fc + {j - l){k - m„) + \Hj\] 

where Vj is the smallest integer such that 2mnVj > \Hj\. Now define Uk.j,i — X^ieif , ^i-fe^i and 



K:i= E T.Uk,, and < 

j=u (mod 3) / odd 



1= E Uk,, 

j=u (mod 3) / even 



(60) 



for u = 0, 1, 2. Observe that each i?^'^, (u = 0,1,2; o = 1, 2) is a sum of independent random variables. By 
(35), ||?7A;.j,i|l — 2K404|?7fej-./|^/^. By Corollary 1.7 of Nagaev (1979) where we take yi = ^/n in their result, 
we have for any A > 

2 

P {\RnM > 6AV«logn) < E E ( 



u=0 o=l,2 



R 



> A 



a/ n log 



^E E E^(l^fc'J.'l 
ti=0o=i,2 j,; 

2A2 



1-/3 . „/3p/4\ P\/l°g"/(P+4) 



12 exp <^ - 



(p + 4)2 . eP/2 • k2 . el 



logn i- =: /„_fc + Iln.k + /^fn,fe, 



(61) 



where the range of j, Z in the sum YTj i (60). Clearly, X]fc=3m ^^n.fc = ''(I)- Similarly as the proof of 



Lemma 



12 



we can show that X]fc=3m ^n.k = o(l). Therefore, if e = Cp/6, then X]I-=3m ^-^^n,fc = ^) 



Case 2: I < k < 3m„. This case is easier. By splitting the interval [fc + l,n] into blocks with size 4m„ 



and using a similar argument as (61 ), we have 

3rnji — 1 



lim P(\R„,k\ > Cpy/nlogi 

n—^oo ^ — ^ \ 



fe=l 



n = 0. 



The proof is complete. 



□ 
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4-4- Box- Pierce tests 

Similarly as the proof of Theorem [T] we use m-dependence approximations and blocking arguments to prove 
Theorem |4] We first outline the intermediate steps and give the main proof in Section [4.4.1[ and then provide 
proofs of the intermediate lemmas in Section 4.4.2 and Section 4.4.3 We prove Theorem [6] in Section 4.4.4 



and prove Corollary [5] and [7| in Section |4.4.5[ 
4.4-1- Proof of Theorem^ 

Step 1: m-dependence approximation. Recall that Rn,k = "Yll^k+i^-^i-kXi—^k)- Without loss of generality, 



assume s„ < [n'^J . Set m„ = 2[n^J. Let = 'Hl_^^^Xi and R^^k = J^'Lk+ii^i-k^i - %)■ By (35 1 and 



(431, we know if 84 (m) — o{m ") for some a > 0, then for all 1 < fc < s„ 

E\Rlk - i?2 < ||i?„ + R^j^W . ||i?„_fe - i?„ fell < Ce^ n • 64 (m„/2) = a (n^-"^) . 

The condition J^'kLo ^^^ii^) < °° implies that Qi{m) = 0{mr^). Therefore, under the conditions of Theo- 
rem [4j we have 

— ^Eo (rI^-RI^^op{1). 



fc=l 



Step 2: Throw out small blocks. Let Z„ — [n^l, where 77 G (/3,1). Split the interval [l,n] into alternating 



small and large blocks similarly as (41 1: 

Hj = [s„ + {j - l)(2?7l„ + In) + 1, S„ + (j - 1)(2771„ + In) + L] 1 < j < Wn 

Kj = [sn + {j - l)i2mn + In) + In + I, Sn+ .i{2mn + In)]; 1 < j < w„ - 1; and 

Kw„ = [Sn + (W„ - 1)(2771„ + In) + In + 1, , 

where w„ is the largest integer such that s„ + (wn — l)(277i„ + In) + In < n. Define Ukfi = 0, Vkfl = 
J2^eK„,^>ki^i-kX^-%), and Ukj = J2,eH,i^^-k^^~%), Vfcj = Y.^eK, i^'^-k^i ' Ik) for l<j< Wn. Set 
T^n.k = Y^J^i Uk,j- Observe that by construction, Uk.j, 1 < J < Wn are iid random variables. In the following 
lemma we show that it suffices to consider TZn,k- 

Lemma 15. Assume Xi e , EX^ = 0, and J2V=o^^^s,{k) < oo, then 

— ^Eo (Rl,~nl,)^opii). 



n^/Sn 



Step 3: Central limit theorem concerning TZn,k 's. 

Lemma 16. Assume Xi G C^, EXi — 0, and J2V=o ^^^'^ 



V " k=i \ feez / 
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We are now ready to prove Theorem [4j 
Proof of Theorem^ By Lemma \TE\ and Lemma [TBI we know 

E i^lk - ^^'l'^) ^ ( 0, 2 E 



It remains to show that 



^ Sr. 

V " fc=l 



(62) 



We need Lemma 
and hence 
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with a shght modification. Observe that in equation (91 ), we now have jy^=i B2(j)^ < oo, 



- {n - fc)(To| <G [n- fc)A4([fc/3j + 1) + ^Jn-k 



With the condition 88 (m) — o[m elementary calculations show that A4(m) — o(m ^). Then (62) follows, 
and the proof is complete. □ 



4-4- ^- Step 2: Throw out small blocks. 



Let A2 be the collection of all double arrays A — {aij)i j>i such that 



Alloc := max < supY^ |aij|, supY^ \aij\ > < 00 
»>i ~f .?>i rrf 



For A,B e A2, define AB — {J2'k'=i^ikbkj)- It is easily seen that AB e A2 and ||Ai3||oo < II A||oo||-B||oo- 
Furthermore, this fact implies the following proposition, which will be useful in computing sums of products 
of cumulants. For d > 0, let Ad be the collection of all d-dimensional array A — A{ii, i2, ■ ■ ■ ,id) such that 



l^ll. 



max 

l<i<o 



,id)\ > < 00. 



X < sup \A{ii,i2, ■ 

Note that Aq = M, and || A||oo = \A\ if A e Ao- 

Proposition 17. For k > Q, I > and d> 1, if A E Ak+d and B € Ai+d' define an array C by 

C(ji, . . . ,ifc,ife+i, . . .,ik+i) = E A{ii, ...,ik,ji,.. .Jd)B{ji,. . .,jd,ik+i,- ■ ■,ik+i) 

ji,...,jd>i 

then C e Ak+i, and \\C\\oo < ||A||oo||S||oo. 

we present an upper bound for Cov(i?„jij, Rn,h)- We formulate the lemma in a more general 



In Lemma 



18 



way for later uses in the proofs of Lemma 15 and Lemma 16 For a /c-dimensional random vector {Yi , . . . ,Yk) 
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such that ||yi||fe < oo for 1 < z < fc, denote by CvLm{Yi, . . . ,Yk) its fc-th order joint cumulant. For the 
stationary process (Xi)i^z, we write 



j{ki,k2, . ■ . ,kd) :== Cum{Xo,Xki,Xk2, - ■ ■ ,XkJ. 



We need the assumption of summabihty of joint cumulants in Lemma |18[ Lemma [15] and Lemma |16| For 
this reason, we provide a sufficient condition in Section |6] 

Lemma 18. Assume Xi e EX^ = 0, 02 < oo and X^fci fca feasz ''^a, ^s)! < oo. For k,h > 1, 



> in > and Sn e Z, set Uk = Y.''ilLii^i-kXi - jk) and Vh = YfZ^s\i(^ ] i ~ ^hen we have 



\nUkVh)\<tr,^{k,h) 

where [S(fc, /i)fc^/i>i] is a symmetric double array oj non-negative numbers such that S € A2, and 

||S||oo<2e4+ |7(fci,fe,fc3)|. 

Proof. Write 

t=i ]=i 

= ^^[7(-fc, j + s„ - i - + s„ - i) 
For the sum of the second term, we have 



In. tn 

i=l j = l 



Similarly, for the sum of the last term 

In tn 



tn-1 



(7s„+d+/c-/i7s„+rf)(in ~ d) 



d=l 



7s„+d+fe-?i7s„+d 

d=tn-ln 

- X! (7s„+£;+fe-/i7s„+<i)(^« + rf) 

d=l-ln 



<tn ^ |7s„+<i+fc-h7s„+d| 
<in ^ Cd+k-hCd- 



del 



J=l i=l 
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Observe that E^i Edez Cd+fc-hCd < (EdezCd) < ©I and similarly Er=i Edez Cd+fe+'^Cd < For the 
sum of the first term, it holds that 



i=l ] = 1 



del 



Utilizing the summability of cumulants, the proof is complete. □ 

In the proof of Lemma |15[ we need the concept of indecomposable partitions. Consider the table 

(1,1) ... (l,Ji) 



Denote the j-th row of the table by "dj. A partition v = {vi, . . . , i^q} of the table is said to be indecomposable 
if there are no sets Vi-^, . . . , Vi^, [k < q) and rows 'dj^ ,...,19^; (/ < /) such that i/^j U • • • U i^i^ = U • • • U . 

Proof of Lemma \15\ Write 



k=l 



fc=l 



k=l 



Using Lemma 16 we know //„/(nys^) — op(l). We can express J„ as 

1 1 

In — IfiMb = In.QO + ^n,01 + In.lO + 

a=0 b=0 

where for a, 6 = 0, 1 (assume without loss of generality that w„ is even), 

In,ab = ^1^0 I ^ Uk^2j-a ^ ^fc,2i-b 
fc=l \ 3=0 ]=0 



Consider the first term in (63), write 



k,h=l 



E Eo(C/fe,2jT4,2j) •Eo(t/,.,2j-l4^2j) 



+ E E E(C/fe,2,,f//.,2jJE(^fe,2,,n,2jJ 
+ E E E(C/.,2,,n,2,JE(F,,2,,;7;,,2,J 

/c,/l=l jl#j2 

:= An + -Bri + C'n- 



By Lemma [TS} it holds that 



|Sn| < E E 



S(/c,/i) 
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(63) 



Sn 2 



k,h=l 



where ^„(A;,/i) is the 'E,{k,h) (defined in Lemma 18 1 for the sequence (Xi). Similarly, 



k,h=l ji j2 = l 



S„(fc, ft.) 



k.h=l 



To deal with A„, we express it in terms of cumulants 



= X! [Cum{Uk,2j, Vk,2j, Uh,2j, Vh,2j) 

k,h=l j = l 

+E{Uk,2jUh,2jMVk^2jVh,2j) 
+E{Uk^2jVh,2jMVk^2jUh,2j)] 
= ■ Dn + En + Fn- 

Apparently |i?„| = o(n^s„) and \Fn\ — o(n^s„). Using the multilincarity of cumulants, we have 
Cum{Uk,2j,Vk,2j,Uh,2j,Vh,2j) = E E Cum(Xi 



32) 



for 1 < fc, ft < s„. By Theorem II. 2 of Rosenblatt ( 1985 1, we know 

b 

Cum (^Xj,_fcXi,,Xj,_fc^ji,X,2-/»X,2,Xj,_,,Xj,) = XI n Cuni(Xj, i e Vq) (64) 

q—l 

where the sum is over all indecomposable partitions v ~ {vi, . . . ^Vq\ of the table 

ii — k ii 

ji - k ji 
12 -h 12 
32 - h j2 



By Theorem 21 the condition X^fcLo k^^si^) < oo implies that all the joint cumulants up to order eight are 
absolutely summable. Therefore, using Proposition |17[ we know 

£ |Cum(C/fc,2„ 14,2,, U^,2j, V^,2j)\ = 0(1^^2,14), 

k,h=l 

and it follows that |Z3„| = O {{wnirin + ln)sn) — o(n^s„). We have shown that £{1^ oo) — o(n^s„), which, in 
conjunction with similar results for the other three terms in (63), implies that E(/^) — o(n?Sn) and hence 
In/ {n^/s^i) — op(l). The proof is now complete. □ 
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4-.4.3. Step 3: Central limit theorem concerning TZn,k 's- 



Proof of Lemma \l6[ Let T„(fc,ft,) :~ E{Uk.iUh.i) and Vn{k,h) := T„(/c, By Lemma 18 we know 
|u„(fc,/i)| < S„(fc,/i). Write 



i-1 



J2 {uIj - T„(fc, fc)) + 2 ^ Uk,, Uk,. 



fe=i 



E 



j=l Lk=l 



/=1 



£(C/2,.-T„(fc,fc)) 



Using similar a argument as the one for dealing with the term An in Lemma 15 we know 



E 



k=l 



and it follows that 



n^/s. 



£(C/fe2,.-T„(A:,fc)) 



.fc=i 



-op(l). 



Therefore, it suffices to consider 



E E^^'^.^E^^M =:E^"..- 
j=i \fc=i 1=1 / j=i 

Let t^nj — (-D„.i, . . . ,Dn,j)- Observe that (Dnj) is a martingale difference sequence with respect to {Q„ 
We shall apply the martingale central limit theorem. Write 

k,h=l 1 = 1 / 

= "^nik, h) (j2 Uk,iUH.i - (j - l)T„(fc, hU 

k,h=l \l = l ) 

+ Y ^»(^' ^) E E ^'^'9 + E E ^'^.^ 

\/=l g=l i=l q=l / 



For the first term, by Lemma [TSj we have 

2 



E J 



itl,, — 1 



E(^""j)' 



3 = 1 



Y\'^n{kM\\{Uk,3Uh,,-Tn{k,h))\\ 



k,h 



Y\Mk,h)\-4el 

kji 
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Using Lemma 18 and Proposition 17 we obtain 

2 



Wr, — 1 



J-1 



J-1 



E - E ^"(^' ^) ^fc.^- E ^'M + f^/^J E ^'^.^ 



/=1 



= 2 ^ {Wn - j)"^ {j ~ 1) E T„(fci, /li)T„(A;2,/l2) [T„(fci, fc2)T„(/li,/l2) + T„(fci,/l2)T„(/li,fc2)] 

Therefore, we have 



4o. 



Using Lemma [TS] and Lemma [24j we know 



k,h=l 



and it follows that 



fcez 



To verify the Lindeberg condition, we compute 



fci ,/C2 1^3 5^4 — 1 



X E 



L \i=l 



\l=l / 



< ^ |E(t/fc,^,C/fe„,[/fc3,,C/fc,^,)|-2C440--l)2/2e8 

fci ,/C2 ,^3 ,fc4 — 1 

We express E([/fcj,iJ7fc2^i[/fc3_iC/fe^,i) in terms of cmnulants 

HUk,,iUk,,iUk,.iUk,,i) = Cnm{Uk,^,Uk,,i,Uk,A,Uk,A)+E{Uk,,iUk,.i)EiUk,,iUk,,i) 

+ E{Uk,,iUk,,i)E{Uk,,iUk,s) + E{Uk,,iUk,,i)E{Uk,,iUk,,i) 



(65) 



From Lemma [Tsj it is easily seen that 

E \Bn\<ll E S„(fci,fc2)-S„(fc3,fc4)=0(/^^), 

All ,^2 ,^3 ,^4 — 1 ^1 ,^2 ,^3 ,^4 — 1 

and similarly T,tlMMM=i 1^"' = and Efc" fc2,fc3,fc4=i \^^\ = 0{llsl). By multilinearity of cumu- 

lants, 
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il ,12 ,»3 ,U = 1 



Each cumulant in the preceding equation is to be further simphfied similarly as (64). Using summability of 



joint cumulants up to order eight and Proposition 17 we have 



fcl ,/C2 1^3 :^4 — 1 



Using orders obtained for \A„\, |-B„|, \En\ and |F„|, we obtain J2j=i^^n,j — "(^i Then, by (65), we 



can apply Corollary 3.1. of Hall and Heyde (1980) to obtain 



V ' " j=i \ feez / 



and the lemma follows. 



□ 



4-4-4- Proof of Theorem^ 



Proof of Theorem^ We shall only prove (22), since (21) can be obtained by very similar arguments. Write 

7fc = Eo7fe + 7fe - (7fc - E7fc), and hence 

Y^if, - 7fe) - 2|]7fcEo7fc + E(Eo7fc)' - 2!] ^7feEo7fc " 2^ ^7fc + £ ^^fc 

fe=l k=l k=l k=l k=l k=l 

= : 2/„ + //„ + ///„ + IVn + Vn. 

Using the conditions 64 < 00 and s„ = o(y'n), it is easily seen that — > and y/nVn — >■ 0. Furthermore 



V^||J//„|| < 2V^y -|7fc| • — 4 ^ and v^E//„ < V^V 



4ei 



fc=i 



Define Yi — X]fc°=i IkXi-k- For the term /„, write 



n/„ = ^ Eo(X,r,) - ^ Eo ( X, ^ 7fcX,_fe ) + ^ 7fe ( ^(^.-fc^^ - 7fc) 



1=1 



i=l 



fc=l \j=l 



Clearly ||S„||/^ < ^^'^^ |7fe|2el\/fc/yr^ ^ 0. Define VK„,, = X, EZs„+i lkX,-k, then 

'54(i) • 04l]fcLs„+i l7fc| if 0<i<s„ 



It follows that 



©4<54(i)E^s„+i l7fc| + 04ELs,.+i l7fe|^4(«- fc) if i > s„. 



\Bn/V^\\<2el l7fcl^0. 



Set Zi = X^y^, then (Zj) is a stationary process of the form (|9|. Furthermore 

00 i 

\\V°Z,\\ < Si{{) • 94^^ |7fe| + 04 ^ bkMi - k). 



k=l 



k=l 



Since X)i^o utilizing Theorem 1 in Hannan (1973) we have A^j \pn =4> A/'(0, H-DolP), and then 



(22) follows. 



□ 
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4.4-5. Proof of Corollary^ and^ 



Proof of Corollary^ and^ By (33), we know ||riX„||4 < ■\/3n04, and it follows that 



{Xi-k — Xn){Xi — Xn) — Xi-kXi 
i=k+l i=k+l 



< 901 



Theorem [4] holds for 7^ because 

2 E I - E7fc)2 - (7;, - E%)^ I < ^ £ ||7fc + 7fe - 2E7fe|| • ||7fe - % \ 



n 



k = l 



< 



k = l 



E 

k=l 



49? 99? \ 99 



/n n 



In Theorem [6] ([22]) holds with 7^ replaced by 7^ because 

Sn Sn / 

x/?i^E|72-72| < V^£||7fc + 7fe|| • \\lk-%\\ < VnY, + ^ 

k=l k=l k=l ^ ^ 



491 99l\ 904 



0, 



and (21 1 can be proved similarly. Now we turn to the sample autocorrelations. Write 

2(Eo7fe)[7fc(7o-7o)] , 7fe(7o-7o)^ 



J2{[fk-il- k/n)r,f ~ [7fc/70 - (1 - k/n)r,f} = ^ 



fe=i 



k = l 



7o7o 



7o7o 



Since 



£E|(Eo7fe)7fe(7o-7o)|<E2C392^. (17^1 + 20392^) •2C392^ 



k=l 



fe=l 



and similarly X]fe=i ^ 7^(70 ~ 7o) = o{y/J^/n), (19) follows by applying the Slutsky theorem. To show the 



limit theorems in Corollary [7) note that using the Cramer- Wold device, we have 



V"(7o - 7o). V" E ^fc ~ E 



\fe=i k=i , 

converges to a bivariate normal distribution. Then Corollary [7] follows by applying the delta method. □ 



5. A Normal Comparison Principle 

In this section we shall control tail probabilities of Gaussian vectors by using their covariance matrices. Denote 
by (pci{{rij); xi, . . . , xj) the density of a c?-dimensional multivariate normal random vector X — (Xi, . . . , Xci)^ 
with mean zero and covariance matrix (r^j), where we always assume ra — 1 for \ < i < d and {rij) is 
nonsingular. For 1 < ft, < Z < d, we use Lp2{{rij)] Xh — Xh,Xi = xi) to denote the marginal density of the 
sub-vector {Xh,Xi)^ . Let 



{d{{'rij)]Zi,...,Zd) = 



00 pOO 



ipdiirij),xi, . . . ,Xd) dxd--- dxi 
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The partial derivative with respect to r^i is obtained similarly as equation (3.6) of Berman (19641 by using 



equation (3) of |Plackett| ( |1954[ ) 

dQdiirij);zi,...,Zd) 



dr, 



hi 



(H / \ fd{{rij);xi,...,Xh-i,Zh,Xh+i,...,xi-i,zi,xi+i,...,Xd) Y\_ '^^k 



(66) 



where (771,^1, ; f°° 1 stands for r°° • • • f°° f°° ■ ■ ■ f°° f°° ■ ■ ■ f°°.U all the zi. have the same value z, we 
use the simplified notation Qd ((?'ij); z) and dQd{{rij); z)/drhi. The following simple facts about conditional 
distribution will be useful. For four different indicies 1 < h,l,k,m < d, we have 

Tkh + Tkl 



^{Xk\Xh = Xi = z) 



l + na 



Var(X,|X, ^X,^z)^ I - rh - rj, - rl, + 2r,,r„,r,, 



Cov{Xk,Xm\Xh = Xi= z) = rkm - 



rhkrhm + rikTim - rhiThkrim - rhirhmrik 



(67) 
(68) 
(69) 



Lemma 19. For every z>0, 0<s<l, d>l and e > 0, there exists positive constants C'd and such 
that for < e < ed 



1. if \rij\ < e for all 1 < i < j < d, then 



Id ((ry );z) < Cdcxp 



Cde]z 



:{{rij);z,...,z) < Cdfd{<^A/z) exp 



^ - Crfe 1 z^ 



Qd{{rij);sz,z,...,z) < Cdexp 



s'^ + d-l 



Cde z 



(70) 
(71) 
(72) 



where f2k{x,y) ^J2i=oX y 



I 2(k-l) 



id f2k-i{x,y) = J2i=o ^ y 



- ™/,,2(fc-0-l 



for k > 1; 



2. if for alll<i<j<d+l such that {i,j) =/= (1, 2), \rij\ < e, then 

Qa+i {{n,);z) < Cdexp S^- (^(i^M!±^ _ 
Proof. The following facts about normal tail probabilities are well-known: 



Cde z 



P{Xi >x)< 



1 



2ttx 



1'^ for a; > and Hm 



P(Xi > x) 



oo (l/a;)(27r)-i/2exp{-a;72} 



(73) 



(74) 



By (74), the inequalities (70) - (72) with e = are true for the random vector with iid standard normal 
entries. The idea is to compare the desired probability with the corresponding one for such a vector. We first 



prove (70) by induction. When d = 1, the inequality is trivially true. When d = 2, by (66), there exists a 
number r'yi between and ryi such that 



IQ2((ry);2)-Q2(/2;2)| < (^((r^),z,z)|ri2| 
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< c 



cxp{-^}<CexpHl-.).^}, 



which, together with Q2{l2]z) < Cexp{— 2;^}, imphes (70) for d = 2 with 62 = 1/2 and some C2 > 1. Now 
for > 3, assume (70) holds for all dimensions less than d. There exists a matrix (r'^j) = d{rij) + (1 — 0)1^, 
for some < 6* < 1 such that 



'.iirijy,z)-QdiiId;z)= 



l<h,l<d 



— {{r^j)-z,...,z)rhi. 



hi 



(75) 



By (167|, ¥.{Xk\Xh ^ Xi ^ z) < 2e'z/{l - e') for k ^ h,l. Therefore, by writing the density in ([66]) as the 
product of the density of {Xh, Xi) and the conditional density of /} given X^ — Xi ^ z, where ;} 

denotes the sub- vector {Xi, . . . , X^-i, X^+i, . . . , . . . , Xd)^ ; we have 

dQd. 



drh 



-{{4A;z,...,z) 



(76) 



where {r'^.^^^) is the correlation matrix of the conditional distribution of X_j^j^ i^ given Xh and Xi. By (68) 
and (691, we know for k,m £ [d] \ {h, 1} and k ^ m, 

+ 



Ya.T{Xk\Xh ^Xi = z)>l-3e'~ 2e^ and Cov{Xk,Xm\Xh ^ Xi ^ z) < 



1 -e 



Therefore, all the off-diagonal entries of (r'^^j^i^i) are less than 2e if we let e < 1/5. Applying the induction 
hypothesis, if 2e < ed-2, then 

'd-2 



Qd-2i{K^lJ; (1 - 3e)z) < Cd-2 exp |- (^^ 



- 2Cd^2e (1 - SeYz' 



and equation ( 76 ) becomes 
dQd 



.,z) 



< CQ_2exp{-(l-e)z2} .expl- 



ri- 2 



- (2Cd-2 + 3(d-2))e z 



Therefore, (70 1 holds for < min{l/5, 6^-2/2} and some Cd > 2Cd_2 + 3(d - 2) + 1. 



Using very similar arguments, inequality i72K can be proved by applying (70 1; and inequality (73) can be 



obtained by employing both ( 70 ) and ( 72 1 . To prove inequality ( 71 1 , which is a refinement of ( 70 ) , it suffices 



to observe that, by ([741 , ([751 and ^ 

Qd{{nj);z)<Qd{Id;z)+ C'e exp{-(l - e)z2}g^_2(«,|w); (i - 3e)z) 

l<h,l<d 
.2 ■ 



< 



1 r ^^2 ^ 

C^d^cxp — +Qeexp{-(l-e)z2} ^ g,_2((r:,|„); (1 - 36)z); 



and apply the induction argument. 



□ 



Lemma 20. Let (X„) he a stationary mean zero Gaussian process. Let = Cov{Xq, X^.)- Assume tq = 1, 
and lim„_j.oo ''n (log n) = 0. Let an = (21ogn)^^/^, fe„ = (21ogn)^/^ — (Slog n)^-*^/^ (log log n + log 47r), and 
Zn = 0-nZ + bn for z G M. Define the event Ai — {Xi > z„}, and 

Qn,d^ Y n • • • n A,J. 

l<'ii < . . ■<id'^'>^ 
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Then lim„_>.oo Qn.d ~ e '^^ /d ! for all d > 1. 

Proof. Note that = 21ogn — log log n — log(47r) + 2z + o(l). If [Xn] consists of iid random variables, by 



the equality in (74), 



lini Q„,d = Jim ( ^ ]Qd{Id,Zn 



n— >-oo 



When the X^s are dependent, the result is still trivially true when d = 1. Now we deal with the d > 2 
case. Let 7^. — sup^^j, \rj\, then 71 < 1 by stationarity, and lim„^oo 7n log — 0. Consider an ordered subset 
J = {i, f + Zi, t + + ^2, ■ • • , ^ + ^1 + ■ ■ • + ^d-i} C [n], where li, . . . , Id-i > 1- We define an equivalence 
relation ^ on J by saying fc ^ j if there exists fci, . . . , /cp G J such that k = ki < k2 < ■ ■ ■ < kp = j, and 
kh — kh-i < L for 2 < h < p. For any L > 2, denote by s( J, L) the number of Ij which are less than or equal 
to L. To similify the notation, we sometimes use s instead of s{ J, L). J is divided into d—s equivalence classes 
Bi, . . . , Bd-s- Suppose s > 1, assume w.l.o.g. that \Bi\ > 2. Pick /cq, ki e Bi, and kp ^ Bp for 2 < p < d — s, 



and set K = {ko, fci, fc2, . . . , fcd-s}- Define Qj = P{r\kejAk) and Qk similarly, then Qj < Qk- By (73) of 
Lemma [l9j there exists a number M > 1 depending on d and the sequence (7fc), such that when L > M, 

Qk < Cd-se.J-( ^'-^'^l + '-' -Cd-.,,)zl 



< Cd-s exp 



d-s , (I-71)' 



2 

2 



2 3 

Note that z"^^ = 21ogn — log log n + 0(1). Pick L„ — max{[n°J,M} for some a < 2(1 — 'jf)/3d. For any 
1 < a < d— 1, since there are at most L'^n'^^"' ordered subset J C [n] such that s{J,Ln) — a, we know the 
sum of Qj over these J is dominated by 

r Ji , , 2(d-l)(l-7i)2 2(1 -71)^ 
Crf_a exp <^ log n ( (d - a) H ^^-^ (d - a) 

when n is large enough, which converges to zero. Therefore, it suffices to consider all the ordered subsets J 
such that Ij > Ln for all 1 < j < d — 1. 

Let J = {ti, . . . ,trf} C [n] be an ordered subset such that ti — > L„ for 2 < i < d, and J'(d, L„) 
be the collection of all such subsets. Let (r^j) be the d-dimensional covariance matrix of X j. There exists a 
matrix Rj = 9{rij)ij(zj + (1 — d)Id for some < 6' < 1 such that 

Qj - QdiId,Zn) = W^[Rj'^Zn\r,j. 

UThl 

h,leJ,h<l 

Let Rh, H — J\ {h,l}, be the correlation matrix of the conditional distribution of Xh given X^ and Xi. 
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By ( 71 ) of Lemma 19 for n large enough 

dQd, 



drhi 



[Rj;zn] < C exp 



1 + li-h 

< CCd-2fd-2ilL„A/Zn)exp 

d-2 



Qd-2 {Rk; (1 - 37l„)z„) 

~2 



It follows that 



X exp 

< Cd fd-2{lL„ A/ Zn)eTip 

< Cdfd^2ilL„A/ Zn) exp 



X] \Qj ~ Qd{ld;z„) 

JeJid,L„) 



1 + ll-h 

-2Cd-2lL,, ] {l-3^Ljfz^ 
d 



{2Cd-2 + 3(d - 2))7i„ 

C<i7L„ - Ih-l I zl 



Ih-i z. 



CdlL^ -lt,-U Z^ \lt,-t 



<Cdfd-2hL„,l/Zn) X! °^P] 

JeJ{d,L„) l<i<j<d ^ 

= Cdfd-2{lL„,l/Zn) (^~(^dlL,,-lt,-Uj zl\-it,-U- 



For each fixed pair 1 < i < j < the inner sum in ( 77 ) is bounded by 

n-l 



'■d7L„ - li\z^>ii 



Cdfd-2ilL„,l/zn) Y (n-iy-'expLf^-Cd 
<Cdfd-2{lL,,,l/zn){\ognf/^n-^ J2 (n-0'"'exp{(Cd7L„ +7021ogn}7i 

i=L„+l 

<Cd/d-2(7L«°J,l/2n)7Ln°j(logn)'^/2exp{2(Cd + 1)7l„=j logn} . 



(77) 



(78) 
(79) 



Since lim„_yoo 7™ log n = 0, it also holds that lim„_j.oo 7^„qj logn = 0. Note that lim„_>.oo(logn)"'^^^/z„ 



2^1/2^ it follows that lim„^oo /d-2(7Ln°j , l/2„)(logn)''/2-i = 2"'^/2+i_ Therefore, the term in (79) converges 
to zero, and the proof is complete. □ 



Remark 3 . This lemma provides another proof of Theorem 3 . 1 in Barman ( 1964 ) , which gives the asymptotic 
distribution of the maximum term of a stationary Gaussian process. They also showed that the theorem is 
true if the condition lim„_>.oo ''n log n = is replaced by ''n < Under the later condition, if we 



replace 7*^-*^ by |rt^._t.| in (77), 7; by \ri\ in (78), then the term in (78) converges to zero, and hence our 
result remains true. 

Remark 4. In the proof, the upper bounds on Qj and \Qj—Q{Id] Zn) \ are expressed through the absolute val- 
ues of the correlations, so we can obtain the same bounds for probabilities of the form P{ni<i<d{{~l)"'^ Xt^ > 
Zn}) for any (ai, . . . ,ad) G {0, l}"*. Therefore, our result can be used to show the asymptotic distribution of 
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the maximum absolute term of a stationary Gaussian process. Specifically, we have 



lim P max \Xi\ < a2nX + b2n ] ~ exp{— exp(— x)}. 

n— 5-00 V l<i<n 



Deo (19721 obtained this result under the condition lim„^oo ''n(logfi)^^" = for some a > 0, whereas we 
only need lim„_>.oo log n = 0. 



6. Summability of Cumulants 

For a fc-dimensional random vector (Yi, . . . , Y^) such that < oo for 1 < i < fc, the fc-th order joint 

cumulant is defined as 



j=l \ iSi/j 



(80) 



where the summation extends over all partitions {i^i, . . . , Vp} of the set {1, 2, . . . , fc} into p non-empty blocks. 
For a stationary process {Xi)i^j,, we abbreviate 

7(fci,fc2,...,fcd) := Cum{Xo, Xki, Xk2, ■ ■ ■ , Xka), 

Summability conditions of cumulants are often assumed in the spectral analysis of time series, see for example 



Brillinger (2001 


I and 


Rosenblatt 


( 


1985 



(2008) in studying the spectral properties of banded sample covariance matrices. While such conditions are 



true for some Gaussian processes, functions of Gaussian processes (Rosenblatt 1985), and linear processes 



with iid innovations ( Anderson 1971 ) , they are not easy to verify in general. Wu and Shao ( 2004 ) showed 
that the summability of joint cumulants of order d holds under the condition that Sd{k) = 0{p^) for some 
< p < 1. We present in Theorem [21] a generalization of their result. To simplify the proof, we introduce the 
composition of an integer. A composition of a positive integer n is an ordered sequence of strictly positive 



integers {vi, U2, . . . , Vq\ such that vi 



n. Two sequences that differ in the order of their terms 



define different compositions. There are in total 2" ^ different compositions of the integer n. For example, 
we are giving in the following all of the eight compositions of the integer 4. 

{1,1,1,1} {1,1,2} {1,2,1} {1,3} {2,1,1} {2,2} {3,1} {4}. 

Theorem 21. Assume d>2, X,e C'^+^ and EX, = 0. // 

OO 

Y.k'^-'Sd+iik) (81) 

then 



fe=o 



X] h{ki,k2,.-.,kd)\ < oo. 

All ,. . . 
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(82) 



Proof. By symmetry of the cumulant in its arguments and stationarity of the process, it sufhees to show 

^ |7(fcl,/C2,---,fcd)| < oo. 

0<i;i<fc2<---<fcc! 

Set X{k,j) := HjXk, we claim 

7(fci, A:2, . . . , fed) = ^ Cum [Xo, XihA), X{K^^i, 1), Xk^^ - X{K^ , 1), 

+ 1)7 -'^fei,, - X{k^^,km + 1), 



+ l),...,X(ka-uk^^+l),Xk,-X{ka,k^^+l)\ ■ (83) 

where the sum is taken over ah the 2''"^ increasing sequences {wq, Ui, . . . , fg, "i^g+i} such that uq = 0, Wq+i = d 
and {vi^V2 — vit ■ ■ TVq — Vq^i^d — Vq\ is a composition of the integer d. We first consider the last summand 
which corresponds to the sequence {uq = 0, ui = d}, 

Cum [Xo, X(fci, 1), . . . , 1), Xfe, - 1)] 

Observe that Xo and (X(fci, 1), . . . , X{kd-l^ 1)) are independent. By definition, only partitions for which Xo 



and Xfe^ — X{kd^ 1) are in the same block contribute to the sum in (80). Suppose {vi, . . . , Vp\ is a partition 
of the set {fci, /c2, . . . , kd-l\^ since 





E 



x^{Xk,-x{kdA)) n 



j = -oo 




p.xopjXfe, n x(fc, 1) 



it follows that 
E 

and therefore 



Xo{Xk,-x{kdA)) n 



J j=2 \ kevj 



<Y.^d+i{j)&d+i{kd+ i)4+\ 

]=0 



J2 |Cum[Xo,X(fci,l),...,X(fcrf_i,l),X,, -X(fcd,l) 

0<fei<fe2<---<fcd 



00 00 



/k + d— 1^ 



j=0 /c=0 



d- 1 



0</ci<fe2<---<fcd j = 

provided that X^feLo k'^^^6d+i{k) < 00. 

The other terms in ( |83[ ) are easier to deal with. For example, for the term corresponding to the sequence 

{vq ~ 0,vi ~ 1, W2 ~ d}, we have 

|Cum [Xo, Xk, ~ X(fci, 1), X(fc2, fci + 1), . . . , Xikd-uki + 1), X^, - X(fcd, ki + 1)]| 
< CdKd+l'i'd+i{ki)'bd+i{kd - fci). 
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Since J2T=o ^ ^^d+i{k) < oo implies J2T=o ^ < oo, it follows that 

J2 |Cum [Xo, Xk, - Xih, 1), X{k2,h + 1), . . . , X{kd-i,ki + l),Xk, - fci + 1) 

0</£l</£2<---<fcd 

°° °° ^ rl — 0\ 



fc=0 



fc=0 



d-2 



We have shown that every cumulant in (83) is absolutely summable over < ki < ■ ■ ■ < kd, and it remains 



to show the claim (83). We shall derive the case d — 3, (83) for other values of d are obtained using the same 



idea. By multilinearity of cumulants, we have 

7(fci,fc2,fc3) ^Cuui{Xq, Xk^, Xk2, Xk^) 

= Cum[Xo,Xk,- X{k,,l),Xk,,Xk,] 

+ Cum [Xo, l),Xk, - Xik2, l),Xk,] 

+ Cum [Xo, X(fci, 1), X(fc2, l),Xk, - X{k3, 1)] 

+ Cum [Xo, 1), X(fc2, 1), X(fc3, 1)] . 

Since Xo and {X(ki, l),X{k2, l),X(fc3, 1)) are independent, the last cumulant is 0. Apply the same trick for 
the first two cumulants, we have 

CmJi[Xo,Xk,-X{ki,l),Xk,,Xk,] 

= Cnm[Xo,Xk, - Xiki,l),Xk, - X{k2,ki + l),Xk,] 

+ Cum [Xo, Xk, ~ X(fci, 1), X(fc2, k, + 1), Xk, - ^(fcg, k, + 1)] 

+ Cum [Xo, Xk, ~ X{ki, 1), X(fc2, fci + 1), X{k3, fci + 1)] 

= Cnm[Xo,Xk,-Xiki,l),Xk,-X{k2,ki + l),Xk,-X{k:,,k2 + l)] 

+ Cum [Xo, Xk, - X{ku l),X{k2, ki + 1), Xk, - X^ks, fci + 1)] 

and 



Cum [Xo,Xiki,l),Xk,- Xik2,l),Xk,]^Cum[Xo,Xiki,l),Xk,- X{k2,l),Xk,- X{k3,k2 + 1)] 
Then the proof is complete. 



□ 



Remark 5. When d ~ 1, (81) reduces to the short-range dependence or short-memory condition Q2 



SfcLo'^2(fc) < OO- If ©2 = 00, then the process (Xi) may be long-memory in that the covariances are not 
summable. When d > 2, we conjecture that ( [sT] ) can be weakened to O^+i < 00. It holds for linear processes. 
Let Xk = Y,°loai^k-f Assume Cfc G and J2T=o < ^^'^'^ Sd+i{k) = |afc| ||eolU+i- Let Cumd+i(eo) 

be the (d + l)-th cumulant of eo. Set ko — 0, by multilinearity of cumulants, we have 



j{ki,...,kd) = rQat^ ] Cum(e_t„,efci_ti,...,efc^_tJ 

to,ti,...,td>0 \j=0 
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Therefore, the condition Qd+i < oo suffices for (82). For a class of functionals of Gaussian processes, Rosen- 



blatt (19851 showed that (82 1 holds if J2k=o < oo, which in turn is iniphed by Bd+i < oo under our 



setting. It is unclear whether in general the weaker condition Qd+i < oo implies (82). 



7. Some Auxiliary Lemmas 



Suppose that X is a d-dimensional random vector, and X ^ A/'(0, S). If E = /d, then by (74), it is easily seen 
that the ratio of P (z„ — c„ < IXj, < z„) over P (|X|, > z„) tends to zero provided that c„ — > 0, z„ — > cxi 
and c„z„ — 0. It is a similar situation when E is not an identity matrix, as shown in the fohowing lemma. 



which will be used in the proof of Lemma 14 



Lemma 22. Let X ^ A/'(0, E) be a d- dimensional normal random vector. Assume E is nonsingular. Let 
Ag and \\ he the smallest and largest eigenvalue of E respectively. Then for 0<c<(5<l/2 such that 
A := (27rA?)(''-i)/2;^gc25-2 + d(5exp{(V6dAi + Ao)/A3} < 1, then for any z e [1,5/c], 

P{z-c< \\X\\, <z)< {l~ A)-^AP{\\X\\, > z). (84) 

Proof. Let Q = {Qdf/'^Xi/ Xq. Since Ag is the smallest eigenvalue of E, 



P(||X||.>z-c) > (27rdet(E))-'^/2gxp<j- 



J d{z + l) 
. 2A; 
Adf 



^2 





2A2c2 



> (27rA?)-'^/2g^p|_ 

Since P(|lX|loo > CdS/c) < d{2TTXl)-^/'^ exp{6d6^ /{2Xlc^)}, we have 

P{\\X\\^ > Cd5/c) < (2^A?)('^"i)/2^2^2^-2 p(||j^||^ >z-c). (85) 

For < fc < [1/(JJ, define the orthotopes Rk = [z + {k — l)c, z + kc] x [z — c, C(j(5/c]'^~^. For two points 
X = (xi, . . . , Xd) e Rq, Xk = {xi + kc, X2,. . ■ , Xd) & Rk, we have xJ.Y.^^Xk — x^T,^^x < {2^/dCd + 1)/Aq, 
and hence P{X e Rk) > exp{-{VdCd + l)/Xl}P{X e i?o) for any I < k < [1/S\. Since the same inequality 
holds for every coordinate, we have 

P{z-c< \\X\\, <z, ||X||oo <CdS/c) <dSexp{iVdCd + l)/Xl}Pi\\X\\, > z - c) (86) 



Combine (85) and (86), we know P (z - c < ||X||. < z) < A • P(||X||. > z - c). So (84) follows. □ 



The preceding lemma requires the eigenvalues of E to be bounded both from above and away from zero. 
In our application, E is taken as the covariance matrix of (Gfc^ , Gfe^ , . . . , G^^)^, where (Gfc) is defined in 
([6]). Furthermore, we need such bounds be uniform over all choices of fci < fc2 < • • • < fc^. Let /(w) = 
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{2tt) ^J2hez'^hC0s{huj) be the spectral density of (Gk)- A sufficient condition would be that there exists 
< m < A/ such that 

m < f{uj) < M, for u G [0, 27r], (87) 
because the eigenvalues of the autocovariance matrix are bounded from above and below by the maximum 



and minimum values that / takes respectively. For the proof see Section 5.2 of Grenander and Szego (19581. 
Clearly the upper bound in (87) is satisfied in our situation, because X]/iez I'^'il < oo. However, the existence 



of lower bound in ( |87[ ) rules out some classical times series models. For example, if (Gk) is the moving average 
of the form Gk = [rjk + '7fc-i)/V2, then f{uj) = (1 + cos(w))/27r, and /(tt) — 0. Nevertheless, although the 
minimum eigenvalue of the autocovariance matrix converges to inftjg[o,27r] f{^) the dimension of the matrix 
goes to infinity, there does exist a positive lower bound for the smallest eigenvalues of all the principal sub- 
matrices with a fixed dimension. 

Lemma 23. If'^hez '^h ^ then for each d> 1, there exists a constant Cd > such that 

mi A,„in {Gov [{Gk,,Gk,, . • . , Gk.V] } > 

fcl <k2<---<kd 

Proof. We use induction. It is clear that we can choose {Cd) to be a non-increasing sequence. Without loss 
of generality, let us assume ki ^ I. The statement is trivially true when d — 1. Suppose it is true for 
all dimensions up to d, we now consider the dimension (d + 1) case. There exist an integer Nd such that 
Y,h=Na < 2C^/(d + 1). If all the differences fc^+i - fc^ < A'^^ for 1 < i < d - 1, there are N^^^ possible 
choices of fci = 1 < fc2 < • • • < fc^. Since the process {Gk) is non-deterministic, for all these choices, the 
corresponding covariancc matrices arc non-singular. Pick > to be the smallest eigenvalue of all these 
matrices. If there is one difference fci+i — /c/ > Nd, set Si = Gov[(GfcJi<i<;] and E2 = GoY[{Gki)i<i<d\, then 
Aniin(Si) > Cd and Ainin(S2) > Cd- It follows that for any real numbers ci, C2, . . . , such that Ylt^i ~ 

^ CiCj Coy {Gki.Gk^) = 
i<ij<rf 




2 

Setting Cd+i = min{Cci/2, C^}, the proof is complete. □ 
The following lemma is used in the proof of Lemma |13[ 
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Lemma 24. Assume Xi G , EXq = 0, and O4 < 00. Assume In — >■ 00, A:„ — >■ 00, m„ < [fc„/3j and ft, > 0. 
Define Sn,k = Z]'=i(-'^»-fe^i " 7fe)- 

|E (5'„,fc„S'„,fe„+h) - <Jh\ < el (l6A4(m„ + 1) + 6eWmn/ln+ ^^iirhn + 1)) ■ 

Proof. Let Xi — 'Hl_^^Xi, then Xi and are independent, because m„ < [fc„/3j. Define Sn,k 



^i'^i Xi-kXi. By (40 1, we have for any fc > 0, 

{,Sn,k- Sn,k)/\/ln < 4^4 A4 (to„ + 1) . 



(89) 

By (35), IjS'n^fe/v^ll < 2K484 for any fc > 0, and it foUows that 

\^{Sn,k„T Sn,k„+h) — E(<5'n,fc„ <5'n,fc„+/i) | 

< ||'S'n,fe„ — 'S'n,fe„ II • |i'5'n,*:„ + ft || + ||'S'n,fc„ || ' ||5'„^fc„+;i — iSn^fej^-i-ft || (90) 

< 16^„K404A4(?7l„ + 1). 

For any fc > 3to„, define M^.k = Yl)=i Dj, where Dj = I]i="" ^i^k'P^Xi = X]^"o Xj+q^k'P^ X^+q. Observe 
that Xj+q and Xj+q-k are independent, we have 



|'S'„,fc — Mn,k 



J2 J2 ^^-k'P'^^ ^^-k'P'^, 



< 



j+m„ 
j — l — rhn i—1 



(91) 



According to the proof of Theorem 2 of Wu (2009), when fc > 37n„ ||M„^fe/>A^II^ = X^feez^fc' where 



Ik 



EXoXfc. By ([34]) and ([37]), |7fe| < Ck\ and hence 



\M. 



n.kh/n 



<Y.^l=Y. ('^2(j)^2(/)E^2(j+fc)<52(/ + fc)) 



< E ^2(j>2(/)*^ < e^M. 



(92) 



By (35) and (37), H^n^fe/v^H < 2^464 for any fc > 0. Combining (91) and (92), we have 



|E(5„,fc,.^„,fc„+h) -E(M„,fc„M„,fe„+,0| < (2«;4e4 + 62*2) V^n ■2^262 (93) 
Observe that when fc„ > 3m„, Xq^kn^q'-k^-h and V'^XqV'^Xqi are independent for < q,q' < m„. 
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Therefore, 



vq,q'=o 



q,q'=Q 



feGZ g'GZ 

fcez g'ez 

^ Ik+hlk- 



By (38), |7fc - 7a;| < 2k2*2("i + 1). Since |7fe| < (k and |7fc| < Cfc, we have 



C/i — ^ ^k+hlk 



feez 



feez 

< 4K2*2(m+ l)^Cfc < 4k2*2(™+ 1)8: 



feez 



(94) 



(95) 



Combining (90), (93) and (95), the lemma follows by noting that K2, K4 are dominated by 84; and Q2{'), 
^'2(0 and 5*4 (•) are all dominated by 84(-). □ 
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